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1 CACCCTATCC TACACTACTA 



61 TGCCCACCCT GGCTCC C AAA 



121 GCAGCCAAAC GCCTCCTTCT 



181 AGGCCTGAAG GAACAGGCCG 



241 CCAGCGCCGG CATCTGGAGA 
1 

301 GTCCTCCTCT GTGCTTGTGC 
13VLLC ACA 

361 GTGAGAAATT GTTCCACCAA 
33VRNC STN 



421 TCACTCACAG CCCTCCGCCA 
53SLTA LRQ 

481 GGCACAGATG CTCACATGAA 
73GTDA HMN 



541 ACAGGCTTTA CAGGGTCTGC 
93 T G F T GSA 



601 ACCGACAGTC GCTACTGGAC 
113 T D S R Y W T 



661 AAGGAAGTTG GCACCACTCC 
133 K E V G T T P 



721 CGTGTGGGTT TTGACCCCTT 
153 R V G F D P F 



781 CTCCAAGGCT CTAACAGACA 
173 L Q G S N R Q 



841 GGATCAGAGA GGCCACCGGT 
193 G S E R P P V 



Figure 1A 



GGAACTTGCA CAGTCCGCCT 



ACCCTCCAAA ACAAAAGACC 



TGACGC CAGC CCCCACCCTC 



GGGAAGGAGC CCTCCCTCTC 



CCCTATGGCC CGGGCTCACT 
MA R A H W 



CTGGGGCCAC ACAAAGCCAC 
W G H T K P L 

CCCCCCTTAC CTTCCAGTTA 
P P Y L P V T 

GCAGATGCAG ACCCAGAATC 
Q M Q T Q K L. 

CGAGTACATC GGCCAACATG 
E Y I G Q H D 



AGGAACTGCA GTGGTGACTA 
G T A V V T M 



TCAGGCTGAG CGGCAAATGG 
Q A E R Q M D 



TATTGTCACC TGGCTCCTCA 
I V T W L L T 

CCTCTTGTCC ATTGACACCT 
L L S I D T W 



GCTGGTGTCC ATCACAACCA 
L V S I T T N 



TCCAAATCAA CCCATTTATG 
P M Q P I Y A 



CGGGCAGCCC AAAGCTCCTC 60 



AGAAAAGCAC TCTCCACCCA 12 0 



TGTCTGCTCG AGCCCAGGAA 180 



TCCCTTGTCC CTCCATCCAC 240 



GGGGCTGCTG CCCCTGGCTG 3 00 
G C C P W L 12 



TGGACCTTGG AGGGCAGGAT 360 
D L G G Q D 32 



CTGTGGTCAA TACCACAATG 420 
V V N T T M 52 



TCTCAGCCTA CATCATCCCA 480 
SAY IIP 72 



ACGAGAGGCG TGCGTGGATT 540 
ERR A W I 92 



TGAAGAAAGC AGCTGTCTGG 600 
K K A A V W 112 



ACTGTAATTG GGAGCTCCAT 660 
C N W E L H 132 



CCGAGATTCC CGCTGGAGGG 72 0 
E I P A G G 152 



GGGAGAGTTA TGATCTGGCC 7 80 

E S Y D L A 172 

ATCTTGTGGA CCTGGTATGG 840 

LVD L V W 192 

CCCTGCAGGA GGCAT'TCACA 9 00 

L Q E AFT 212 



901 GGGAGCACTT GGCAGGAGAA AGTATCTGGC GTCCGAAGCC AGATGCAGAA GCATCAAAAG 960 
213 GSTW QEK VSG VRSQ MQK HQK 232 
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Figure IB 



9 61 GTCCCGACTG CCGTCCTTCT GTCGGCGCTT GAGGAGACGG CCTGGCTCTT CAACCTTCGA 102 0 
233 VPTA VLL SAL EETA W L F NLR 252 



1021 GCCAGTGACA TCCCCTATAA CCCCTTCTTC TATTCCTACA CGCTGCTCAC AGACTCTTCT 1080 
253 ASDI PYN PFF YSYT LLT DSS 272 



1081 ATTAGGTTGT TTGCAAACAA GAGTCGCTTT AGCTCCGAAA CCTTGAGCTA TCTGAACTCC 1140 
273 IRLF ANK SRF SSET LSY LNS 292 



1141 AGT TGC AC AG GCCCCATGTG TGTGCAAATC GAGGATTACA GCCAAGTTCG TGACAGCATC 12 00 
293 SCTG PMC V Q I EDYS QVR DSI 312 



1201 CAGGCCTACT CATTGGGAGA TGTGAGGATC TGGATTGGGA CCAGCTATAC CATGTATGGG 12 60 
313 QAYS LGD VRI WIGT SYT MYG 332 



1261 ATCTATGAAA TGATACCAAG GGAGAAACTC GTGACAGACA CCTACTCCCC AGTGATGATG 13 20 
333 IYEM IPR E K L VTDT YSP VMM 352 



1321 ACCAAGGCAG TGAAGAACAG CAAGGAGCAG GCCCTCCTCA AGGCCAGCCA CGTGC GGGAC 13 80 
353 T K A V KNS KEQ ALLK ASH VRD 372 



13 81 GCTGTGGCTG TGATCCGGTA CTTGGTCTGG CTGGAGAAGA ACGTGCCCAA AGGCACAGTG 1440 
373 AVAV IRY LVW LEKN VPK GTV 392 



1441 GATGAGTTTT CGGGGGCAGA GATCGTGGAC AAGTTCCGAG GAGAAGAACA GTTCTCCTCC 15 00 

393 DEFS GAE IVD KFRG EEQ FSS 412 

15 01 GGACCCAGTT TTGAAACCAT CTCTGCTAGT GGTTTGAATG CTGCCCTGGC CCACTACAGC 15 60 

413 GPSF ETI SAS GLNA ALA HYS 432 



15 61 CCGACCAAGG AGCTGAACCG CAAGCTGTCC TCAGATGAGA TGTACCTGCT GGACTCTGGG 162 0 
433 PTKE LNR KLS SDEM YLL DSG 452 



1621 GGGCAGTACT GGGACGGGAC CACAGACATC ACCAGAACAG TCCACTGGGG CACCCCCTCT 1680 
453 GQYW DGT TDI TRTV HWG TPS 472 



1681 GCCTTTCAGA AGGAGGCATA TACCCGTGTG CTGATAGGAA ATATTGACCT GTCCAGGCTC 1740 
473 AFQK EAY TRV LIGN IDL SRL 492 

1741 ATCTTTCCCG CTGCTACATC AGGGCGAATG GTGGAGGCCT TTGCCCGCAG AGCCTTGTGG 18 00 
493 IFPA ATS GRM VEAF ARR ALW 512 

1801 GATGCTGGTC TCAATTATGG TCATGGGACA GGCCACGGCA TTGGCAACTT CCTGTGTGTG 1860 
513 DAGL NYG HGT GHGI GNF LCV 532 
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18 61 CATGAGTGGC CAGTGGGATT CCAGTCCAAC AACATCGCTA TGGC CAAGGG CATGTTCACT 192 0 
533 HEWP VGF QSN NIAM AKG MFT 552 

1921 TCCATTGAAC CTGGTTACTA TAAGGATGGA GAATTTGGGA TCCGTCTCGA AGATGTGGCT 19 80 
553 SIEP GYY KDG EFGI RLE DVA 572 

1981 CTCGTGGTAG AAGCAAAGAC CAAGTACCCA GGGGAGCTAC CTGACCTTGT GGTATCATTT 2040 
573 L V V E AKT KYP G E L P D L V VSF 592 

2041 GTGCCCTATG ACCGGAACCT CATCGATGTC AGCCTGCTGT CTCCCGAGCA TCTCCAGTAC 2100 
593 VPYD R N L IDV SLL.S PEH LQY 612 

2101 CTGAATCGCT ACTACCAGAC CATCCGGGAG AAGGTGGGTC CAGAGCTGCA GAGGCGCCAG 2160 
613LNRY YQT IRE KVGP ELQ RRQ 632 

2161 CTACTAGAGG AGTTCGAGTG GCTTCAACAG CACACAGAGC CCCTGGCCGC CAGGGCCCCA 2220 
633 LLEE FEW LQQ HTEP L A A RAP 652 

2221 GACACCGCCT CCTGGGCCTC TGTGTTAGTG GTCTCCACCC TTGCCATCCT TGGCTGGAGT 22 80 
653 DTAS WAS V L V VSTL AIL GWS 672 

2281 GTCTAGAGGC TCCAGACTCT CCTGTTAACC CTCCATCTAG ATGGGGGGCT CCCTTGCTTA 2340 
673 V * 673 

2341 GCTCCCCTCA CCCTGCACTG AACATACCCC AAGAGCCCCT GCTGGCCCAT TGCCTAGAAA 2400 

2401 CCTTTGCATT CATCCTCCTT CTCCAAGACC TATGGAGAAG GTCCCAGGCC CCAGGAAACA 2460 

2461 CAGGGCTTCT TGGCCCCAGA TGGCACCTCC CTGCACCCCG GGGTTGTATA CCACACCCTG 2 520 

2 521 GGCCCCTAAT CCCAGGCCCC GAAATAGGAA AGCCAGCTAG TCTCTTCTCT TCTGTGATCT 2 580 

2 581 CAGTAGGCCT AACCTATAAC CTAACACAGA CTGCTACAGC TGCTCCCCTC CCGCCAAACA 2 640 

2 641 AAGCCCCAAG AAAACAATGC CCCTACCACC CAAGGGTGCC ATGGTCCCGG GAAAACCCAA 270 0 

2 701 CCTGTCACCG CGTGTTGGGC GTAACCAGAA CTGTTCCCCC CCACCAGGGC TTAAAAATCG 2760 

2761 CCCCCACTTT TTAACCATCG TCCATTAACC ACCTGGTGGG CATAGCCAGA GCTGTTCGAA 2 82 0 

2821 CCCAGCCAGG GATGAAAAAT CAACCCCCGA CATGGAACCC ATGATTCCTA AACCCGGGGT 2 880 

2 881 AGGTTCCATG CCAAGTAACA GCAGAGGGAG TTAAGCCATA GGAATTTGGC TGTGGAGTAA 2 940 
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Figure ID 

2941 GAGGGAATGC GGTGAGGCAG TGTGGAATAT GACCCTACCA GAGGTTGGAG AACAAACTTG 3000 
3001 GGCAGCCGGA ACCCGTCACT ATTTTAGATT CCTGGCATTC GAGGAGCCCT TTGAACTTTC 3060 
3 061 CAAAGTGCAG CCACAGCTAC AATGCTGTTA AATCCTCCCA CATTTCTTGG ATGCCCCTTC 312 0 
3121 ACCTTGTGTG GACAGTGTCT GGTTTCCCCA TTTTACAGAC AGGAAAACTG AGCTTCAGAC 3180 
3181 AGGGGGTGGG CTTTGCCTAA GGACACACAA ATTTGGTTGG GAGTTGATGG GGCCAGATGA 3240 
32 41 GCCAGCATTC CAGCTGTTTC ACCCTTCAGC AACATGCAGA GTCCCTGAGC CCACCTCCCA 3300 
3 301 GCCCTCTCCT CATTCTCTGA ACCCACTGTG GTGAGAAGAA TTTGCTCCGG CCAAATTGGC 336 0 
3 361 CGTTAGCCAC CTGGGTCCAC ATCCTGCTAA GACGTTTAAA ACAGCCTAAC AAAGAC AC TT 342 0 
3421 GCCTGTGG 3428 
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Figure 2A 



1 CACCCTATCC TACACTACTA GGAACTTGCA CAGTCCGCCT CGGGCAGCCC AAAGCTCCTC 60 

61 TGCCCACCCT GGCTCCCAAA ACCCTCCAAA ACAAAAGACC AGAAAAGCAC TCTCCACCCA 12 0 

121 GCAGCCAAAC GCCTCCTTCT TGACGCCAGC CCCCACCCTC TGTCTGCTCG AGCCCAGGAA 180 

181 AGGCCTGAAG GAACAGGCCG GGGAAGGAGC CCTCCCTCTC TCCCTTGTCC CTCCATCCAC 240 



241 CCAGCGCCGG CATC TGGAGA CCCTATGGCC CGGGCTCACT GGGGCTGCTG CCCCTGGCTG 300 
1 MARAHWGCCPWL12 



301 GTCCTCCTCT GTGCTTGTGC CTGGGGC C AC ACAAAGCCAC TGGACCTTGG AGGGCAGGAT 360 
13 V L L C ACA WGH TKPL DLG GQD 32 

361 GTGAGAAATT GTTCCACCAA CCCCCCTTAC CTTCCAGTTA CTGTGGTCAA TACCACAATG 420 
3 3V R N C STN PPY LPVT VVN TTM 52 



421 TC AC TC AC AG CCCTCCGCCA GCAGATGCAG ACCCAGAATC TCTCAGCCTA CATCATCCCA 480 
53SLTA LRQ QMQ TQNL SAY IIP 72 

481 GGCACAGATG CTCACATGAA CGAGTACATC GGCCAACATG ACGAGAGGCG TGCGTGGATT 540 
73GTDA HMN EYI GQHD ERR AWI 92 



541 ACAGGCTTTA CAGGGTCTGC AGGAACTGCA GTGGTGACTA TGAAGAAAGC AGCTGTCTGG 600 
93TGFT GSA GTA VVTM KKA AVW 112 

601 ACCGACAGTC GCTACTGGAC TCAGGCTGAG CGGCAAATGG ACTGTAATTG GGAGCTCCAT 660 
113 TDSR YWT Q A E ~ R Q M D CNW ELH 132 

661 AAGGAAGTTG GCACCACTCC TATTGTCACC TGGCTCCTCA CCGAGATTCC CGCTGGAGGG 720 
133 KEVG TTP IVT WLLT EIP AGG 152 

721 CGTGTGGGTT TTGACCCCTT CCTCTTGTCC ATTGACACCT GGGAGAGTTA TGATCTGGCC 780 
153 RVGF DPF L Li S IDTW ESY DLA 172 

7 81 CTCCAAGGCT CTAACAGACA GCTGGTGTCC ATCACAACCA ATCTTGTGGA CCTGGTATGG 840 
173LQGS NRQ LVS ITTN LVD LVW 192 

841 GGATCAGAGA GGCCACCGGT TCCAAATCAA CCCATTTATG CCCTGCAGGA GGCATTCACA 90 0 
193GSER PPV PNQ PIYA LQE AFT 212 

901 GGGAGCACTT GGCAGGAGAA AGTATCTGGC GTCCGAAGCC AGATGCAGAA GCATCAAAAG 960 
213GSTW QEK VSG VRSQ MQK HQK 232 
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Figure 2B 

961 GTCCCGACTG CCGTCCTTCT GTCGGCGCTT GAGGAGACGG CCTGGCTCTT CAACCTTCGA 102 0 
233 VPTA VLL SAL EETA WLF NLR 252 

1021 GCCAGTGACA TCCCCTATAA CCCCTTCTTC TATTCCTACA CGCTGCTCAC AGACTCTTCT 10 80 
253 ASDI PYN PFF YSYT LLT DSS 272 



1081 ATTAGGTTGT TTGCAAACAA GAGTCGCTTT AGCTCCGAAA CCTTGAGCTA TCTGAACTCC 1140 
273 IRLF ANK SRF SSET LSY LNS 292 



1141 AGTTGCACAG GCCCCATGTG TGTGCAAATC GAGGATTACA GCCAAGTTCG TGACAGCATC 1200 
293 SCTG PMC VQI EDYS QVR DSI 312 



12 01 CAGGCCTACT CATTGGGAGA TGTGAGGATC TGGATTGGGA CCAGCTATAC CATGTATGGG 1260 

313 QAYS LGD VRI WIGT SYT MYG 332 

12 61 ATCTATGAAA TGATACCAAG GGAGAAACTC GTGACAGACA CCTACTCCCC AGTGATGATG 1320 
333 IYEM IPR EKL VTDT YSP VMM 352 

1321 ACCAAGGCAG TGAAGAACAG CAAGGAGCAG GCCCTCCTCA AGGCCAGCCA CGTGCGGGAC 1380 

3 53 T K A V KNS KEQ A L L K ASH VRD 372 

13 81 GCTGTGGCTG TGATCCGGTA CTTGGTCTGG CTGGAGAAGA ACGTGCCCAA AGGCACAGTG 1440 
373 AVAV IRY LVW LEKN VPK GTV 392 



1441 GATGAGTTTT CGGGGGCAGA GATCGTGGAC AAGTTCCGAG GAGAAGAACA GTTCTCCTCC 1500 
393 DEFS GAE IVD KFRG EEQ FSS 412 

1501 GGACCCAGTT TTGAAAC CAT CTCTGCTAGT GGTTTGAATG CTGCCCTGGC CCACTACAGC 1560 
413 GPSF ETI SAS GLNA ALA HYS 432 

1561 CCGACCAAGG AGCTGAACCG CAAGCTGTCC TCAGATGAGA TGTACCTGCT GGACTCTGGG 162 0 
433 PTKE LWR KLS SDEM YLL DSG 452 



1621 GGGCAGTACT GGGACGGGAC CACAGACATC AC C AGAAC AG TCCACTGGGG CACCCCCTCT 1680 
453 GQYW DGT TDI TRTV HWG TPS 472 



1681 GCCTTTCAGA AGGAGGCATA TACCCGTGTG CTGATAGGAA ATATTGACCT GTCCAGGCTC 1740 
473 AFQK EAY TRV LIGN IDL SRL 492 

1741 ATCTTTCCCG CTGCTACATC AGGGC GAATG GTGGAGGCCT TTGCCCGCAG AGCCTTGTGG 1800 
493 IFPA ATS GRM VEAF ARR ALW 512 

1801 GATGCTGGTC TCAATTATGG TCATGGGACA GGCCACGGCA TTGGCAACTT CCTGTGTGTG 1860 
513 DAGL NYG HGT GHGI GNF LCV 532 
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1861 CATGAGTGGC CAGTGGGATT CCAGTCCAAC AACATCGCTA TGGCCAAGGG CATGTTCACT 192 0 
533 HEWP VGF QSN NIAM AKG MFT 552 

1921 TCCATTGAAC CTGGTTACTA TAAGGATGGA GAATTTGGGA TCCGTCTCGA AGATGTGGCT 1980 
553 SIEP GYY KDG EFGI RLE DVA 572 

1981 CTCGTGGTAG AAGCAAAGAC CAAGTACCCA GGGGAGC T AC CTGACCTTGT GGTATCATTT 2040 
573 L V V E AKT KYP GELP D L V VSF 592 

2041 GTGCCCTATG ACCGGAACCT CATCGATGTC AGCCTGCTGT CTCCGGAGCA TCTCCAGTAC 2100 
593 VPYD R N Li IDV SLLS PEH LQY 612 

2101 CTGAATCGCT ACTACCAGAC CATCCGGGAG AAGGTGGGTC CAGAGCTGCA GAGGC GC C AG 2160 
613 LNRY YQT IRE KVGP ELQ RRQ 632 

2161 CTACTAGAGG AGTTCGAGTG GCTTCAACAG CACACAGAGC CCCTGGCCGC CAGGGCCCCA 2220 
633 LLEE FEW LQQ HTEP LAA RAP 652 

2221 GACACCGCCT CCTGGGCCTC TGTGTTAGTG GTCTCCACCC TTGCCATCCT TGGCTGGAGT 2280 
653 DTAS WAS VLV VSTL AIL GWS 672 

22 81 GTCTAGAGGC TCCAGACTCT CCTGTTAACC CTCCATCTAG ATGGGGGGCT CCCTTGCTTA 23 40 
673 V * 673 

2341 GCTCCCCTCA CCCTGCACTG AACATACCCC AAGAGCCCCT GCTGGCCCAT TGCCTAGAAA 2400 

2401 CCTTTGCATT CATCCTCCTT CTCCAAGACC TATGGAGAAG GTCCCAGGCC CCAGGAAACA 2460 

2461 CAGGGCTTCT TGGCCCCAGA TGGCACCTCC CTGCACCCCG GGGTTGTATA CCACACCCTG 2 52 0 

2 521 GGCCCCTAAT CCCAGGCCCC GAAATAGGAA AGCCAGCTAG TCTCTTCTCT TCTGTGATCT 2 580 

25 81 CAGTAGGCCT AACCTATAAC CTAACACAGA CTGCTACAGC TGCTCCCCTC CCGCCAAACA 2 640 

2 641 AAGCCCCAAG AAAACAATGC CCCTACCACC CAAGGGTGCC ATGGTCCCGG GAAAACCCAA 2700 

27 01 CCTGTCACCG CGTGTTGGGC GTAACCAGAA CTGTTCCCCC CCACCAGGGC TTAAAAATCG 276 0 

2761 CCCCCACTTT TTAACCATCG TCCATTAACC ACCTGGTGGG CATAGCCAGA GCTGTTCGAA 2 82 0 

2 821 CCCAGCCAGG GATGAAAAAT CAACCCCCGA CATGGAACCC ATGATTCCTA AACCCGGGGT 2 880 

2881 AGGTTCCATG CCAAGTAACA GCAGAGGGAG TTAAGCCATA GGAATTTGGC TGTGGAGTAA 2 940 
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Figure 2D 

2941 GAGGGAATGC GGTGAGGCAG TGTGGAATAT GACCCTACCA GAGGTTGGAG AACAAACTTG 3000 
3001 GGCAGCCGGA ACCCGTCACT ATTTTAGATT CCTGGCATTC GAGGAGCCCT TTGAACTTTC 3 060 
3061 CAAAGTGCAG CCACAGCTAC AATGC TGTTA AATCCTCCCA CATTTCTTGG ATGCCCCTTC 3120 
3121 ACCTTGTGTG GACAGTGTCT GGTTTCCCCA TTTTACAGAC AGGAAAACTG AGCTTCAGAC 3180 
3181 AGGGGGTGGG CTTTGCCTAA GGACACACAA ATTTGGTTGG GAGTTGATGG GGCCAGATGA 3 240 
3241 GCCAGCATTC CAGCTGTTTC ACCCTTCAGC AACATGCAGA GTCCCTGAGC CCACCTCCCA 3 300 
3301 GCCCTCTCCT CATTCTCTGA ACCCACTGTG GTGAGAAGAA TTTGCTCCGG CCAAATTGGC 3360 
33 61 CGTTAGCCAC CTGGGTCCAC ATCCTGCTAA GACGTTTAAA ACAGCCTAAC AAAGACACTT 3420 
3421 GCCTGTGG 3428 
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Figure 3A 

1 CTGTGCATGG CATCATCCTG GCCCCCTCTA GAGCTCCAAT CCTCCAACCA GAGCCAGCTC 60 
1 MA SSW PPL ELQS SNQ SQL 18 



61 TTCCCTCAAA ATGCTACGGC CTGTGACAAT GCTCCAGAAG CCTGGGACCT GCTGCACAGA 120 

19FPQN ATA CDN APEA WDL LHR 38 

121 GTGCTGCCGA CATTTATCAT CTCCATCTGT TTCTTCGGCC TCCTAGGGAA CCTTTTTGTC 180 

39VLPT FII SIC FFGL LGN LFV 58 

181 CTGTTGGTCT TCCTCCTGCC CCGGCGGCAA CTGAACGTGG CAGAAATCTA CCTGGCCAAC 240 

59 L L V F LLP RRQ LNVA EIY LAN 78 

241 CTGGCAGCCT CTGATCTGGT GTTTGTCTTG GGCTTGCCCT TCTGGGCAGA GAATATCTGG 3 00 

79LAAS DLV FVL GLPF WAE NIW 98 



3 01 AACCAGTTTA ACTGGCCTTT CGGAGCCCTC CTCTGCCGTG TCATCAACGG GGTCATCAAG 360 
99NQFN WPF GAL LCRV I N G VIK 118 



3 61 GCCAATTTGT TCATCAGCAT CTTCCTGGTG GTGGCCATCA GCCAGGACCG CTACCGCGTG 420 
119 ANLF ISI FLV VAIS QDR YRV 138 



421 CTGGTGCACC CTATGGCCAG CGGAAGGCAG CAGCGGCGGA GGCAGGCCCG GGTCACCTGC 480 
139 LVHP MAS GRQ QRRR QAR VTC 158 



481 GTGCTCATCT GGGTTGTGGG GGGCCTCTTG AGCATCCCCA CATTCCTGCT GCGATCCATC 540 
159 VLIW VVG GLL SIPT FLL RSI 178 



541 CAAGCCGTCC CAGATCTGAA CATCACCGCC TGCATCC TGC TCCTCCCCCA TGAGGCCTGG 60 0 
179 QAVP DLN ITA CILL LPH EAW 198 



6 01 CACTTTGCAA GGATTGTGGA GTTAAATATT CTGGGTTTCC TCCTACCACT GGCTGCGATC 66 0 
199 HFAR IVE LNI LGFL LPL A A I 218 



661 GTCTTCTTCA ACTACCACAT CCTGGCCTCC CTGCGAACGC GGGAGGAGGT CAGCAGGACA 72 0 
219 VFFK YHI LAS LRTR EEV SRT 238 



721 AGAGTGCGGG GGCCGAAGGA TAGCAAGACC ACAGCGCTGA TCCTCACGCT CGTGGTTGCC 780 
239 RVRG PKD SKT TALI LTL VVA 258 



7 81 TTCCTGGTCT GCTGGGCCCC TTACCACTTC TTTGCCTTCC TGGAATTCTT ATTCCAGGTG 840 
259 FLVC WAP YHF FAFL EFL FQV 278 



841 CAAGCAGTCC GAGGCTGCTT TTGGGAGGAC TTCATTGACC TGGGCCTGCA ATTGGCCAAC 900 
279 QAVR GCF WED FIDL GLQ LAN 298 
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Figure 3B 



901 TTCTTTGCCT TCACTAACAG CTCCCTGAAT CCAGTAATTT ATGTCTTTGT GGGCCGGCTC 96 0 
299 FFAF TNS SLN PVIY VFV GRL 318 



9 61 TTCAGGACCA AGGTCTGGGA ACTTTATAAA CAATGCACCC CTAAAAGTCT TGCTCCAATA 1020 
319 FRTK VWE LYK QCTP KSL API 338 



1021 TCTTCATCCC ATAGGAAAGA AATCTTCCAA CTTTTCTGGC GGAATTAAAA CAGCATTGAA 1080 
339 SSSH RKE I F Q LFWR N * 353 



1081 CC 1082 
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Figure 4A 

1 CTGTGCATGG CATCATCCTG GCCCCCTCTA GAGCTCCAAT CCTCCAACCA GAGCCAGCTC 60 
1 MA SSW PPL ELQS SNQ SQL 18 

61 TTCCCTCAAA ATGCTACGGC CTGTGACAAT GCTCCAGAAG CCTGGGACCT GCTGCACAGA 12 0 
19FPQN ATA CDN APEA WDL LHR 38 



121 GTGCTGCCGA CATTTATCAT CTCCATCTGT TTCTTCGGCC TCCTAGGGAA CCTTTTTGTC 180 
39VLPT FII SIC FFGL LGN LFV 58 



181 CTGTTGGTCT TCCTCCTGCC CCGGCGGCAA CTGAACGTGG CAGAAATCTA CCTGGCCAAC 240 
59LLVF LLP RRQ LNVA EIY LAN 78 



241 CTGGCAGCCT CTGATCTGGT GTTTGTCTTG GGCTTGCCCT TCTGGGCAGA GAATATCTGG 300 
79LAAS DLV FVL GLPF WAE N I W 98 



3 01 AACCAGTTTA ACTGGCCTTT CGGAGCCCTC CTCTGCCGTG TCATCAACGG GGTCATCAAG 360 
99 N Q F N WPF GAL LCRV ING V I K 118 



3 61 GCCAATTTGT TCATCAGCAT CTTCCTGGTG GTGGCCATCA GCCAGGACCG CTACCGCGTG 420 
119 ANLF ISI FLV VAIS QDR YRV 138 



421 CTGGTGCACC CTATGGCCAG CGGAAGGCAG CAGCGGCGGA GGCAGGCCCG GGTCACCTGC 480 
139 LVHP MAS GRQ QRRR QAR VTC 158 

481 GTGCTCATCT GGGTTGTGGG GGGCCTCTTG AGCATCCCCA CATTCCTGCT GCGATCCATC 540 
159VLIW VVG GLL SIPT FLL RSI 178 

541 CAAGCCGTCC CAGATCTGAA CATCACCGCC TGCATCCTGC TCCTCCCCCA TGAGGCCTGG 600 
179 Q A V P DLN I T A CILL LPH EAW 198 



601 CACTTTGCAA GGATTGTGGA GTTAAATATT CTGGGTTTCC TCCTACCACT GGCTGCGATC 660 
199 HFAR IVE LNI LGFL LPL A A I 218 



661 GTCTTCTTCA ACTACCACAT CCTGGCCTCC CTGCGAACGC GGGAGGAGGT CAGCAGGACA 720 
219 VFFN YHI LAS LRTR EEV SRT 238 



721 AGAGTGCGGG GGCCGAAGGA TAGCAAGACC ACAGCGCTGA TCCTCACGCT CGTGGTTGCC 7 80 
239 RVRG PKD SKT TALI LTL VVA 258 



781 TTCCTGGTCT GCTGGGCCCC TTACCACTTC TTTGCCTTCC TGGAATTCTT ATTCCAGGTG 840 

259 FLVC WAP YHF FAFL EFL FQV 278 

841 CAAGCAGTCC GAGGCTGCTT TTGGGAGGAC TTCATTGACC TGGGCCTGCA ATTGGCCAAC 900 

279 QAVR GCF WED FIDL GLQ LAN 298 
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901 TTCTTTGCCT TCACTAACAG CTCCCTGAAT CCAGTAATTT ATGTCTTTGT GGGCCAGCTC 960 

2 99 F F A F TNS SLN PVIY VFV GQL 318 

9 61 TTCAGGACCA AGGTCTGGGA ACTTTATAAA CAATGCACCC CTAAAAGTCT TGCTCCAATA 102 0 

319 FRTK VWE LYK QCTP KSL API 338 



1081 CC 1082 
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1 CTGTGCATGG CATCATCCTG 
1 MA S S W 



61 TTCCCTCAAA ATGCTACGGC 
19FPQN ATA 

121 GTGCTGC CAA CATTTATCAT 
3 9V L P T FII 

181 CTGTTGGTCT TCCTCCTGCC 
59LLVF LLP 



241 CTGGCAGCCT CTGATCTGGT 
79LAAS DLV 



301 AACCAGTTTA ACTGGCCTTT 
99NQFN WPF 



361 GCCAATTTGT TCATCAGCAT 
119 A N L F I S I 



421 CTGGTGCACC CTATGGCCAG 
139 L V H P MAS 



481 GTGCTCATCT GGGTTGTGGG 
159 V L I W V V G 



541 CAAGCCGTCC CAGATCTGAA 
179 Q A V P D L N 



601 CACTTTGCAA GGATTGTGGA 
199 H F A R I V E 



661 GTCTTCTTCA ACTACCACAT 
219 V F F N Y H I 



721 AGAGTGCGGG GGCCGAAGGA 
239 R V R G P K D 



781 TTCCTGGTCT GCTGGGCCCC 
259 F L V C WAP 



841 CAAGCAGTCC GAGGCTGCTT 
279 Q A V R G C F 



Figure 5A 

GCCCCCTCTA GAGCTCCAAT 
PPL E L Q S 

CTGTGACAAT GCTCCAGAAG 
C D N A P E A 

CTCCATCTGT TTCTTCGGCC 
SIC F F G L 

CCGGCGGCAA CTGAACGTGG 
R R Q L N V A 



GTTTGTCTTG GGCTTGCCCT 
F V L G L P F 

CGGAGCCCTC CTCTGCCGTG 
GAL L C R V 

CTTCCTGGTG GTGGCCATCA 
F L V V A I S 



CGGAAGGCAG CAGCGGCGGA 
G R Q Q R R R 

GGGCCTCTTG AGCATCCCCA 
G L L S I P T 

CATCACCGCC TGCATCCTGC 
I T A C I L L 

GTTAAATATT CTGGGTTTCC 
L N I L G F L 



CCTGGCCTCC CTGCGAACGC 
LAS L R T R 

TAGCAAGACC ACAGCGCTGA 
S K T TALI 

TTACCACTTC TTTGCCTTCC 
Y H F F A F L 

TTGGGAGGAC TTCATTGACC 
WED F I D L 



CCTCCAACCA GAGCCAGCTC 60 
S N Q SQL 18 



CCTGGGACCT GCTGCACAGA 120 
W D L L H R 38 

TCCTAGGGAA CCTTTTTGTC 180 
L G N L F V 58 

CAGAAATCTA CCTGGCCAAC 2 40 
E I Y LAN 78 



TCTGGGCAGA GAATATCTGG 3 00 
W A E N I W 98 



TCATCAACGG GGTCATCAAG 3 60 
I N G V I K 118 



GCCAGGACCG CTACCGCGTG 420 
Q D R Y R V 138 



GGCAGGCCCG GGTCACCTGC 480 
Q A R V T C 158 



CATTCCTGCT GCGATCCATC 540 
F L L RSI 178 



TCCTCCCCCA TGAGGCCTGG 600 
L P H E A W 198 



TCCTACCACT GGCTGCGATC 660 
L P L A A I 218 



GGGAGGAGGT CAGCAGGACA 7 20 
E E V S R T 238 



TCCTCACGCT CGTGGTTGCC 7 80 
L T L V V A 258 



TGGAATTCTT ATTCCAGGTG 840 
E F L F Q V 278 



TGGGCCTGCA ATTGGCCAAC 9 00 
G L Q LAN 298 
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Figure 5B 



901 TTCTTTGCCT TCACTAACAG CTCCCTGAAT CCAGTAATTT ATGTCTTTGT GGGCCGGCTC 9 60 
299 FFAF TNS SLN PVIY VFV GRL 318 



961 TTCAGGACCA AGGTCTGGGA ACTTTATAAA CAATGCACCC CTAAAAGTCT TGCTCCAATA 1020 
319 FRTK VWE LYK QCTP KSL API 338 



1021 TCTTCATCCC ATAGGAAAGA AATCTTCCAA CTTTTCTGGC GGAATTAAAA CAGCATTGAA 108 0 
339 SSSH RKE IFQ LFWR N * 353 



1081 CC 1082 
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Figure 6A 

1 ATGTTCTCTC CCTGGAAGAT ATCAATGTTT CTGTCTGTTC GTGAGGACTC CGTGCCCACC 60 
1MFSP WKI SMF LSVR EDS VPT 20 



61 ACGGCCTCTT TCAGCGCCGA CATGCTCAAT GTCACCTTGC AAGGGC C C AC TCTTAACGGG 120 
21TASF SAD MLN VTLQ GPT LNG 40 

121 ACCTTTGCCC AGAGCAAATG CCCCCAAGTG GAGTGGC TGG GCTGGCTCAA CACCATCCAG 180 
41TFAQ SKC PQV EWLG WLN TIQ 60 

181 CCCCCCTTCC TCTGGGTGCT GTTCGTGCTG GCCACCCTAG AGAACATCTT TGTCCTCAGC 240 
61PPFL WVL FVL ATLE NIF VLS 80 

241 GTCTTCTGCC TGCACAAGAG CAGCTGCACG GTGGCAGAGA TCTACCTGGG GAACCTGGCC 300 
81VFCL HKS SCT VAEI YLG NLA 100 



301 GCAGCAGACC TGATCCTGGC CTGCGGGCTG CCCTTCTGGG CCATCACCAT CTCCAACAAC 360 
101 A A D L I L A CGL PFWA ITI SNN 120 



361 TTCGACTGGC TCTTTGGGGA GACGCTCTGC CGCGTGGTGA ATGCCATTAT CTCCATGAAC 420 
121 FDWL FGE TLC RVVN All SMN 140 



421 CTGTACAGCA GCATCTGTTT CCTGATGCTG GTGAGCATCG ACCGCTACCT GGCCCTGGTG 480 

141 LYSS ICF LML VSID RYL ALV 160 

481 AAAACCATGT CCATGGGCCG GATGCGCGGC GTGCGCTGGG CCAAGCTCTA CAGCTTGGTG 540 

161 KTMS MGR MRG VRWA KLY SLV 180 

541 ATCTGGGGGT GTACGCTGCT CCTGAGCTCA CCCATGCTGG TGTTCCGGAC CATGAAGGAG 60 0 

181 IWGC TLL LSS PMLV FRT MKE 200 

601 TACAGCGATG AGGGCCACAA CGTCACCGCT TGTGTCATCA GCTACCCATC CCTCATCTGG 66 0 

201 YSDE GHN VTA CVIS YPS LIW 220 



661 GAAGTGTTCA CCAACATGCT CCTGAATGTC GTGGGCTTCC TGCTGCCCCT GAGTGTCATC 72 0 
221 EVFT NML LNV VGFL LPL SVI 240 



721 ACCTTCTGCA CGATGCAGAT CATGCAGGTG CTGCGGAACA ACGAGATGCA GAAGTTCAAG 78 0 
241 TFCT MQI MQV LRNN EMQ KFK 260 



7 81 GAGATCCAGA CGGAGAGGAG GGCCACGGTG CTAGTCCTGG TTGTGCTGCT GCTATTCATC 84 0 

261 EIQT ERR A T V LVLV VLL LFI 280 

841 ATCTGCTGGC TGCCCTTCCA GATCAGCACC TTCCTGGATA CGCTGCATCG CCTCGGCATC 900 

281 ICWL PFQ 1ST FLDT LHR LGI 300 
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Figure 6B 

901 CTCTCCAGCT GCCAGGACGA GCGCATCATC GATGTAATCA CACAGATCGC CTCCTTCATG 960 

301 LSSC QDE RII DVIT QIA SFM 320 

961 GCCTACAGCA ACAGCTGCCT CAACCCACTG GTGTACGTGA TCGTGGGCAA GCGCTTCCGA 102 0 
321AYSN SCL NPL VYVI VGK RFR 340 

1021 AAGAAGTCTT GGGAGGTGTA CCAGGGAGTG TGCCAGAAAG GGGGCTGCAG GTCAGAACCC 1080 

341 KKSW EVY QGV CQKG GCR SEP 360 

1081 ATTCAGATGG AGAACTCCAT GGGCACACTG CGGACCTCCA TCTCCGTGGA ACGCCAGATT 1140 

361 IQME NSM GTL RTSI SVE RQI 380 

1141 CACAAACTGC AGGACTGGGC AGGGAGCAGA CAGTGAGCAA ACGCCAGCAG GGCTGCTGTG 12 00 

381 HKLQ DWA GSR Q* 391 

1201 AATTTGTGTA AGGATTGAGG GACAGTTGCT TTTCAGCATG GGCCCAGGAA TGCCAAGGAG 1260 

pi 1261 ACATCTATGC ACGACCTTGG GAAATGAGTT GATGTCTCCG GTAAAACACC GGAGACTAAT 13 20 

IJ] 1321 TCCTGCCCTG CCCAATTTTG CAGGGAGCAT GGC TGTGAGG ATGGGGTGAA CTCACGCACA 1380 

%l 1381 GCCAAGGACT CCAAAATCAC AACAGCATTA CTGTTCTTAT TTGCTGCCAC ACCTGAGCCA 1440 

:;--=.- 1441 GCCTGCTCCT TCCCAGGAGT GGAGGAGGCC TGGGGGGAGG GAGAGGAGTG ACTGAGCTTC 1500 

1501 CCTCCCGTGT GTTCTCCGTC CCTGCCCCAG CAAGACAACT TAGATCTCCA GGAGAACTGC 1560 

o ...... 

1561 CATCCAGCTT TGGTGCAATG GCTGAGTGCA CAAGTGAGTT GTTGCCCTGG GTTTCTTTAA 162 0 

1621 TCTATTCAGC TAGAACTTTG AAGGACAATT TCTTGCATTA ATAAAGGTTA AGCCCTGAGG 1680 

1681 GGTCCCTGAT AACAACCTGG AGACCAGGAT TTTATGGCTC CCCTCACTGA TGGACAAGGA 1740 

1741 GGTCTGTGCC AAAGAAGAAT CCAATAAGCA CATATTGAGC ACTTGCTGTA TATGCAGTAT 18 00 

1801 TGAGCACTGT AGGCAAGACC CAAGAAAGAG AAGGAGC CAT CTCCATCTTG AAGGAACTCA 1860 

1861 AAGACTCAAG TGGGAACGAC TGGGCACTGC CACCACCAGA AAGCTGTTCG ACGAGACGGT 1920 

1921 CGAGCAGGGT GCTGTGGGTG ATATGGACAG CAGAAGGGGG AGACCAAGGT TCCAGCTCAA 19 80 

1981 CCAATAACTA TTGCACAACC ACCTGTCCCT GCCTCAGTTC CCTTTTATGT AACATGAAGT 2040 
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Figu 

2 041 CGTTGTGAGG GTTAAAGGCA GTAACAGGTA 
2101 GTACATGTGA GGCATCATTA CGCAGACGTA 
2161 ACTGAGGTCT AGAAATAGCT CCGTGGAGCA 
2221 AAGCACCAGT GTCTGGCACA CAGTAGGTGC 
2 281 CCACCCTGAG GCCCCAACCG CCACACACAC 
2341 AAAGTCTGAT TTGTGATGAG GCAGAGGAAG 
2 401 ACAGTGCTGA GACCCCCCAC CACCAGCCGG 
2 461 TCAGGGACTG TTCCTGTCTC AGCAACCAAG 
2521 GAAGGTGGCC CAGTATGAGC CCTAGAAGAG 
2581 TCGGCAGTGC CAGGGCAGCA CTCATTCACT 
2641 GAGCTAGAAC CTGGAGAGCT AGAAC CTGGA 
2701 GAGGC T AGAA C C AAGAAGGG CTAGAACCTG 
2761 TGAGCTAGAA GCTGGAGGAC TAGAACCTGG 
2 821 AGGGCTGGAA TCTGGAGAGC TAGAACCTGG 
2 881 AAGGGCTAGA ACCTGGAGGG CTGGAATCTG 

2 941 GAGGGCTAGA ACCTAGAAGG GCTAGAACCT 

3 001 AGAAGGGCTA GAACCTGGAG AGCCAGAACC 
3 0 61 CTGTAGAGCT AGAACATGGA GAGCTAGAAC 
3121 CTGGAGGGAA TGAACCTGGA GGGCTAGAAC 
3181 GAGCCCATAA ATCCTGACCA ATCCAACTCT 



re 6C 

TAAAGTACTT AGAAAAGCAA AGGGTGCTAC 2100 
AC TGGGAT AT GTTTACTATA AGGAAAAGAC 2160 
GAATCAGTAT TGGGAGCCGG TGGCGGTGTG 222 0 
TCATTGGCTC CCTTCCACCT GTCATTCCCA 2280 
AGGAGCATTT GGAGAGAAGG CCATGTCTTC 2340 
ATATTTCTAA TCGGTCTTGC CCAGAGGATC 2400 
TACCTGGGAA GGGGGAGAGT GCAGGCCTGC 2 460 
GGATTGTTCC TGTCAATCAA TGGTTTATTG 2 52 0 
TGTGAAAAGG AATGGCAATG GTGTTCACCA 2580 
TGATAAATGA ATATTTATTA GCTGGTTGGA 2640 
GAACTAGAAC CTGGAGGGCT AGAACCTGGA 27 00 
GAGGGGCTAG AACCTAGAGA AGCTAAAACC 27 60 
AGGGCTGGAA TCTGAAGGGC TAGAACCTGG 2820 
AGGGCTAGAA CCTGGAGGGC TAGAACCTAG 2880 
GAGAGCTAGA ACCTGGAGGG CTAGAACCTG 2940 
GGAGGGCTAG AACCTGGCAG GTTAGAACCT 3000 
TGGAGGGCTA GAACCTGGAA GGGCTAGAAC 3060 
CCGGCAGGCT AGAACCTGGC AAGCTAGAAC 312 0 
CTGGAGAATG AGAAAAATTT ACATGGCAAA 3180 
GAATTTTAAA GCAAAAGCGT GAAAAAAAAG 3240 
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Figure 6D 

3241 ATTCCCTCCT TACCCCCAAC CCACTCTTTT TTCCCACCAC CCACTCTCCT CTGCCTCAGT 3 30 0 

3301 AAGTATCTGG AGGAAGAAAA CAGGTGAAAG AAGAAGTAAA AACCATTTAG TATTAGTATT 33 60 

3361 AGAATGAAGT CAAACTGTGC CACACATGGT GAATGAAAAA AAAAAAAAAG AGGCTGTGTT 342 0 

3 421 TTGTCACACA GGGCAGTCAT TCAGCACCAG AGCACGTGAT GGTCTGAGAC TCTCTTAGGA 3480 

3481 GCAGAGCTCT GCCGCAATGG CCATGTGGGG ATCCACACCT GGTCTGAGGG GCAACTGAGT 3540 
3 541 CTGCGGGAGA AGAGCGGCCC TATGCATGGT GTAGATGCCC TGATAAAGAA CATCTGTCCT 3 60 0 

3 601 GTGAAAGACT CAATGAGCTG TTATGTTGTA AACAGGAAGC ATTTCACATC CAAACGAGAA 3 660 

3 661 AATCATGTAA ACATGTGTCT TTTCTGTAGA GCATAATAAA TGGATGAGGT TTTTGCAAAA 3720 
3721 AAAAAAAAAA AAA 3733 
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Figure 7A 

1 AATTCAGAGC CACCGCGGGC AGGCGGGCAG TGCATCCAGA AGCGTTTATA TTCTGAGCGC 60 



61 CAGTTCAGCT TTCAAAAAGA GTGCTGCCCA TAAAAAGCCT TCCACCCTCC TGTCTGCTTT 12 0 



121 AGAAGGACCC TGAGCCCCAG GCGCCAGCCA CAGGACTCTG CTGCAGAGGG GGGTTGTGTA 180 



181 CAGATAGTAG GCTTTACGCC TAGCTTCGAA ATGGATAACG TCCTCCCGGT GGACTCAGAC 240 
1 MDNVLPVDSD10 



241 CTCTCCCCAA ACATCTCCAC TAACACCTCG GAACCCAATC AGTTCGTGCA ACCAGCCTGG 3 00 
11LSPN 1ST NTS EPNQ FVQ PAW 30 



301 CAAATTGTCC TTTGGGCAGC TGCCTACACG GTCATTGTGG TGACCTCTGT GGTGGGCAAC 3 60 

31QIVL W A A AYT VIVV TSV VGN 50 

361 GTGGTAGTGA TGTGGATCAT CTTAGCCCAC AAAAGAATGA GGACAGTGAC GAACTATTTT 420 

51 V V V M W I I L A H KRMR TVT N Y F 70 



421 CTGGTGAACC TGGCCTTCGC GGAGGCCTCC ATGGCTGCAT TCAATACAGT GGTGAACTTC 480 
71LVNL A F A EAS MAAF NTV VNF 90 



481 ACCTATGCTG TCCACAACGA ATGGT AC TAC GGCCTGTTCT ACTGCAAGTT CCACAACTTC 540 
91 T Y A V HNE WYY GLFY CKF HNF 110 



541 TTTCCCATCG CCGCTGTCTT CGCCAGTATC TACTCCATGA CGGCTGTGGC CTTTGATAGG 600 
111 FPIA A V F ASI YSMT AVA FDR 130 



601 TACATGGCCA TCATACATCC CCTCCAGCCC CGGCTGTCAG CCACAGCCAC CAAAGTGGTC 660 
131 YMAI IHP L Q P RLSA TAT KVV 150 



661 ATCTGTGTCA TCTGGGTCCT GGCTCTCCTG CTGGCCTTCC CCCAGGGCTA CTACTCAACC 72 0 

151 ICVI WVL ALL LAFP QGY YST 170 

721 ACAGAGACCA TGCCCAGCAG AGTCGTGTGC ATGATCGAAT GGCCAGAGCA TCCGAACAAG 780 

171 TETM PSR VVC MIEW PEH PNK 190 



781 ATTTATGAGA AAGTGTACCA CATCTGTGTG ACTGTGCTGA TCTACTTCCT CCCCCTGCTG 840 
191 IYEK VYH ICV TVLI YFL PLL 210 



841 GTGATTGGCT ATGCATACAC CGTAGTGGGA ATCACACTAT GGGCCAGTGA GATCCCCGGG 900 
211 VIGY AYT VVG ITLW ASE IPG 230 



901 GACTCCTCTG ACCGCTACCA CGAGCAAGTC TCTGCCAAGC GCAAGGTGGT CAAAATGATG 960 
231 DSSD RYH EQV SAKR KVV KMM 250 
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Figure 7B 

961 ATTGTCGTGG TGTGCACCTT CGCCATCTGC TGGCTGCCCT TCCACATCTT CTTCCTCCTG 1020 

251IVVV CTF AIC WLPF HIF F L L 270 

1021 CCCTACATCA ACCCAGATCT CTACCTGAAG AAGTTTATCC AGCAGGTCTA CCTGGCCATC 1080 

271 PYIN PDL YLK KFIQ QVY LAI 290 

1081 ATGTGGCTGG CCATGAGCTC CACCATGTAC AACCCCATCA TCTACTGCTG CCTCAATGAC 1140 

291 MWLA MSS TMY NPII YCC LND 310 



1141 AGGTTCCGTC TGGGCTTCAA GCATGCCTTC CGGTGCTGCC CCTTCATCAG CGCCGGCGAC 1200 
311 R F R L GFK HAF RCCP FIS AGD 330 



1201 TATGAGGGGC TGGAAATGAA ATCCACCCGG TATCTCCAGA CCCAGGGCAG TGTGTACAAA 12 60 
331 YEGL EMK STR YLQT QGS VYK 350 



1261 GTCAGCCGCC TGGAGAC C AC CATCTCCACA GTGGTGGGGG CCCACGAGGA GGAGCCAGAG 1320 
351 VSRL ETT 1ST VVGA HEE EPE 370 



1321 GACGGCCCCA AGGCCACACC CTCGTCCCTG GACCTGACCT CCAACTGCTC TTCACGAAGT 1380 
371 DGPK ATP SSL DLTS NCS SRS 390 



13 81 GACTCCAAGA CCATGACAGA GAGCTTCAGC TTCTCCTCCA ATGTGCTCTC CTAGGCCACA 1440 
391 DSKT MTE SFS FSSN VLS * 407 



1441 GGGCCTTTGG CAGGTGCAGC CCCCACTGCC TTTGACCTGC CTCCCTTCAT GCATGGAAAT 15 00 



1501 TCCCTTCATC TGGAACCATC AGAAACACCC TCACACTGGG AC TTGCAAAA AGGGTCAGTA 1560 



1561 TGGGTTAGGG AAAACATTCC ATCCTTGAGT CAAAAAATCT CAATTCTTCC CTATCTTTGC 1620 



1621 CACCCTCATG CTGTGTGACT CAAACCAAAT CACTGAACTT TGCTGAGCCT GTAAAATAAA 1680 



1681 AGGTCGGACC AGCTTTTCCT CAAGAGCCCA ATGCATTCCA TTTCTGGAAG TGACTTTGGC 1740 



1741 TGCATGCGAG TGCTCATTTC AGGATG 17 66 
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Figure 8A 

1 AATTCAGAGC CACCGCGGGC AGGCGGGCAG TGCATCCAGA AGCGTTTATA TTCTGAGCGC 60 



61 CAGTTCAGCT TTCAAAAAGA GTGCTGCCCA TAAAAAGCCT TCCACCCTCC TGTCTGCTTT 120 



121 AGAAGGACCC TGAGCCCCAG GCGCCAGCCA CAGGACTCTG CTGCAGAGGG GGGTTGTGTA 180 



181 CAGATAGTAG GCTTTACGCC TAGCTTCGAA ATGGATAACG TCCTCCCGGT GGACTCAGAC 240 
1 MDNVLPVDSD10 



241 CTCTCCCCAA ACATCTCCAC TAACACCTCG GAACCCAATC AGTTCGTGCA ACCAGCCTGG 3 00 
11LSPN 1ST NTS EPNQ FVQ PAW 30 



301 CAAATTGTCC TTTGGGCAGC TGCCTACACG GTCATTGTGG TGACCTCTGT GGTGGGCAAC 360 
31QIVL W A A AYT VIVV TSV VGN 50 



361 GTGGTAGTGA TGTGGATCAT CTTAGCCCAC AAAAGAATGA GGACAGTGAC GAACTATTTT 420 

51 V V V M WII LAH KRMR TVT NYF 70 

421 CTGGTGAACC TGGCCTTCGC GGAGGCCTCC ATGGCTGCAT TCAATACAGT GGTGAACTTC 480 

71LVNL A F A EAS MAAF NTV VNF 90 

481 ACCTATGCTG TCCACAACGA ATGGTACTAC GGCCTGTTCT ACTGCAAGTT CCACAACTTC 540 

91TYAV HNE WYY GLFY CKF HNF 110 



541 TTCCCCATCG CCGCTGTCTT CGCCAGTATC TACTCCATGA CGGCTGTGGC CTTTGATAGG 600 

111 FPIA A V F ASI YSMT AVA FDR 130 

601 TACATGGCCA TCATACATCC CCTCCAGCCC CGGCTGTCAG CCACAGCCAC CAAAGTGGTC 660 

131 YMAI IHP L Q P RLSA TAT KVV 150 



661 ATCTGTGTCA TCTGGGTCCT GGCTCTCCTG CTGGCCTTCC CCCAGGGCTA CTACTCAACC 72 0 
151 I C V I W V L ALL LAFP QGY YST 170 



721 ACAGAGACCA TGCCCAGCAG AGTC GTGTGC ATGATCGAAT GGCCAGAGCA TCCGAACAAG 780 
171 TETM PSR VVC MIEW PEH PNK 190 



7 81 ATTTATGAGA AAGTGTACCA CATCTGTGTG ACTGTGCTGA TCTACTTCCT CCCCCTGCTG 840 
191 IYEK VYH ICV TVLI YFL PLL 210 



841 GTGATTGGCT ATGCATACAC CGTAGTGGGA ATCACACTAT GGGCCAGTGA GATCCCCGGG 90 0 
211 VIGY AYT VVG ITLW ASE IPG 230 



9 01 GACTCCTCTG ACCGCTACCA CGAGCAAGTC TCTGCCAAGC GCAAGGTGGT CAAAATGATG 960 
231 DSSD RYH EQV SAKR KVV KMM 250 
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Figure 8B 

961 ATTGTCGTGG TGTGCACCTT CGCCATCTGC TGGCTGCCCT TCCACATCTT CTTCCTCCTG 1020 
251IVVV CTF AIC WLPF HIF FLL 270 

1021 CCCTACATCA ACCCAGATCT CTACCTGAAG AAGTTTATCC AGCAGGTCTA CCTGGCCATC 1080 
271 PYIN PDL YLK KFIQ QVY LAI 290 



1081 ATGTGGCTGG CCATGAGCTC CACCATGTAC AACCCCATCA TCTACTGCTG CCTCAATGAC 1140 
291 MWLA MSS TMY NPII YCC LND 310 

1141 AGGTTCCGTC TGGGCTTCAA GCATGCCTTC CGGTGCTGCC CCTTCATCAG CGCCGGCGAC 1200 
311 RFRL GFK HAF RCCP FIS AGD 330 

1201 TATGAGGGGC TGGAAATGAA ATCCACCCGG TATCTCCAGA CCCAGGGCAG TGTGTACAAA 126 0 
331 YEGL EMK STR YLQT QGS VYK 350 

1261 GTCAGCCGCC TGGAGACCAC CATCTCCACA GTGGTGGGGG CCCACGAGGA GGAGCCAGAG 1320 
351 V S R L ETT 1ST VVGA HEE EPE 370 



1321 GACGGCCCCA AGGCCACACC CTCGTCCCTG GACCTGACCT CCAACTGCTC TTCACGAAGT 1380 
371 DGPK ATP SSL DLTS NCS SRS 390 



1381 GACTCCAAGA CCATGACAGA GAGCTTCAGC TTCTCCTCCA ATGTGCTCTC CTAGGCCACA 1440 
391 DSKT MTE SFS FSSN VLS * 407 



1441 GGGCCTTTGG CAGGTGCAGC CCCCACTGCC TTTGACCTGC CTCCCTTCAT GCATGGAAAT 1500 



1501 TCCCTTCATC TGGAACCATC AGAAACACCC TCACACTGGG ACTTGCAAAA AGGGTCAGTA 1560 



1561 TGGGTTAGGG AAAACATTCC ATCCTTGAGT CAAAAAATCT CAATTCTTCC CTATCTTTGC 162 0 



1621 CACCCTCATG CTGTGTGACT CAAACCAAAT CACTGAACTT TGCTGAGCCT GTAAAATAAA 1680 
1681 AGGTCGGACC AGCTTTTCCT CAAGAGCC C A ATGCATTCCA TTTCTGGAAG TGACTTTGGC 1740 



1741 TGCATGCGAG TGCTCATTTC AGGATG 17 66 
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Figure 9A 

1 AATTCAGAGC CACCGCGGGC AGGCGGGCAG TGCATCCAGA AGCGTTTATA TTCTGAGCGC 60 



61 CAGTTCAGCT TTCAAAAAGA GTGCTGCCCA TAAAAAGCCT TCCACCCTCC TGTCTGCTTT 120 



121 AGAAGGACCC TGAGCCCCAG GCGCCAGCCA CAGGACTCTG CTGCAGAGGG GGGTTGTGTA 180 



181 CAGATAGTAG GCTTTACGCC TAGCTTCGAA ATGGATAACG TCCTCCCGGT GGACTCAGAC 240 
1 MDNVLPVDSD10 



241 CTCTCCCCAA ACATCTCCAC TAACACCTCG GAACCCAATC AGTTCGTGCA ACCAGCCTGG 300 
11LSPN 1ST NTS EPNQ FVQ PAW 30 



301 CAAATTGTCC TTTGGGCAGC TGCCTACACG GTCATTGTGG TGACCTCTGT GGTGGGCAAC 360 

31QIVL W A A AYT VIVV TSV VGN 50 

361 GTGGTAGTGA TGTGGATCAT CTTAGCCCAC AAAAGAATGA GGACAGTGAC GAACTATTTT 420 

51 V V V M W I I LAH KRMR TVT NYF 70 

421 CTGGTGAACC TGGCCTTCGC GGAGGCCTCC ATGGCTGCAT TCAATACAGT GGTGAACTTC 480 

71LVNL A F A EAS MAAF N T V VNF 90 

481 AC CTATGCTG TCCACAACGA ATGGT AC T AC GGCCTGTTCT ACTGCAAGTT CCACAACTTC 540 

91TYAV HNE WYY GLFY CKF HNF 110 



541 TTTCCCATCG CCGCTGTCTT CGCCAGTATC TACTCCATGA CGGCTGTGGC CTTTGATAGG 600 
111 FPIA A V F ASI YSMT AVA FDR 130 



601 TACATGGCCA TCATACATCC CCTCCAGCCC CGGCTGTCAG CCACAGCCAC CAAAGTGGTC 660 

131 YMAI I H P LQP R L S A TAT KVV 150 

661 ATCTGTGTCA TATGGGTCCT GGCTCTCCTG CTGGCCTTCC CCCAGGGCTA CTACTCAACC 720 

151 ICVI W V L ALL LAFP QGY YST 170 

721 ACAGAGACCA TGCCCAGCAG AGTCGTGTGC ATGATCGAAT GGCCAGAGCA TCCGAACAAG 7 80 

171 TETM PSR VVC MIEW PEH PNK 190 



781 ATTTATGAGA AAGTGTACCA CATCTGTGTG ACTGTGCTGA TCTACTTCCT CCCCCTGCTG 840 
191 IYEK VYH ICV TVLI YFL PLL 210 



841 GTGATTGGCT ATGCATACAC CGTAGTGGGA ATCACACTAT GGGCCAGTGA GATCCCCGGG 9 00 
211 VIGY AYT VVG ITLW ASE IPG 230 



901 GACTCCTCTG ACCGCTACCA CGAGCAAGTC TCTGCCAAGC GCAAGGTGGT CAAAATGATG 960 
231 DSSD RYH EQV SAKR KVV KMM 250 
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Figure 9B 

961 ATTGTCGTGG TGTGCACCTT CGCCATCTGC TGGCTGCCCT TCCACATCTT CTTCCTCCTG 102 0 
251 IVVV CTF AIC WLPF HIF F L L 270 



1021 CCCTACATCA ACCCAGATCT CTACC TGAAG AAGTTTATCC AGCAGGTCTA CCTGGCCATC 1080 
271PYIN PDL YLK KFIQ QVY LAI 290 



1081 ATGTGGCTGG CCATGAGCTC CACCATGTAC AACCCCATCA TCTACTGCTG CCTCAATGAC 1140 
291 MWLA MSS TMY NPII YCC L N D 310 



1141 AGGTTCCGTC TGGGCTTCAA GCATGCCTTC CGGTGCTGCC CCTTCATCAG CGCCGGCGAC 1200 
311 RFRL GFK HAF RCCP FIS AGD 330 



1201 TATGAGGGGC TGGAAATGAA ATCCACCCGG TATCTCCAGA CCCAGGGCAG TGTGTACAAA 126 0 
331 YEGL EMK STR Y L Q T QGS VYK 350 



1261 GTCAGCCGCC TGGAGACCAC CATCTCCACA GTGGTGGGGG CCCACGAGGA GGAGC C AGAG 132 0 
351 V S R L ETT 1ST VVGA HEE EPE 370 



1321 GACGGCCCCA AGGCCACACC CTCGTCCCTG GACCTGACCT CCAACTGCTC TTCACGAAGT 1380 
371DGPK ATP SSL DLTS NCS SRS 390 



1381 GACTCCAAGA CCATGACAGA GAGCTTCAGC TTCTCCTCCA ATGTGCTCTC CTAGGCCACA 1440 
391 DSKT MTE SFS FSSN VLS * 407 



1441 GGGCCTTTGG CAGGTGCAGC CCCCACTGCC TTTGACCTGC CTCCCTTCAT GCATGGAAAT 1500 



1501 TCCCTTCATC TGGAACCATC AGAAACACCC TCACACTGGG ACTTGCAAAA AGGGTCAGTA 15 60 
1561 TGGGTTAGGG AAAACATTCC ATCCTTGAGT CAAAAAATCT CAATTCTTCC CTATCTTTGC 1620 



1621 CACCCTCATG CTGTGTGACT CAAACCAAAT CACTGAACTT TGCTGAGCCT GTAAAATAAA 1680 
1681 AGGTCGGACC AGCTTTTCCT CAAGAGCCCA ATGCATTCCA TTTCTGGAAG TGACTTTGGC 1740 



1741 TGCATGCGAG TGCTCATTTC AGGATG 1766 
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Figure 10A 

1 AATTCAGAGC CACCGCGGGC AGGCGGGCAG TGCATCCAGA AGCGTTTATA TTCTGAGCGC 60 



61 CAGTTCAGCT TTCAAAAAGA GTGCTGCCCA TAAAAAGCCT TCCACCCTCC TGTCTGCTTT 120 



121 AGAAGGACCC TGAGCCCCAG GCGCCAGCCA CAGGACTCTG CTGCAGAGGG GGGTTGTGTA 180 



181 CAGATAGTAG GCTTTACGCC TAGCTTCGAA ATGGATAACG TCCTCCCGGT GGACTCAGAC 240 
1 MDNVLPVDSD10 



241 CTCTCCCCAA ACATCTCCAC TAACACCTCG GAACCCAATC AGTTCGTGCA ACCAGCCTGG 3 00 
11 L S P N 1ST NTS EPNQ F V Q PAW 30 



3 01 CAAATTGTCC TTTGGGCAGC TGCCTACACG GTCATTGTGG TGACCTCTGT GGTGGGCAAC 3 60 
31QIVL W A A AYT VIVV TSV VGN 50 

361 GTGGTAGTGA TGTGGATCAT CTTAGCCCAC AAAAGAATGA GGACAGTGAC GAACTATTTT 42 0 
51 V V V M WII LAH KRMR TVT NYF 70 



421 CTGGTGAACC TGGCCTTCGC GGAGGCCTCC ATGGCTGCAT TCAATACAGT GGTGAACTTC 480 

71LVNL A F A EAS MAAF NTV VNF 90 

481 ACCTATGCTG TCCACAACGA ATGGTACTAC GGCCTGTTCT ACTGCAAGTT CCACAACTTC 540 

91TYAV HNE WYY GLFY CKF HNF 110 



541 TTTCCCATCG CCGCTGTCTT CGCCAGTATC TACTCCATGA CGGCTGTGGC CTTTGATAGG 600 
111 FPIA A V F ASI YSMT AVA FDR 130 



601 TACATGGCCA TCATACATCC CCTCCAGCCC CGGCTGTCAG CCACAGCCAC CAAAGTGGTC 660 

131 YMAI IHP LQP RLSA TAT KVV 150 

661 ATCTGTGTCA TCTGGGTCCT GGCTCTCCTG CTGGCCTTCC CCCAGGGCTA CTACTCAACC 72 0 

151 ICVI WVL ALL LAFP QGY YST 170 



721 ACAGAGACCA TGCCCAGCAG AGTCGTGTGC ATGATCGAAT GGCCAGAGCA TCCGAACAAG 7 80 
171 TETM PSR VVC MIEW PEH PNK 190 

7 81 ATTTATGAGA AAGTGT AC C A CATCTGTGTG AC TGTGCTGA TCTACTTCCT CCCCCTGCTG 840 
191IYEK VYH ICV TVLI YFL PLL 210 

841 GTGATTGGCT ATGCATACAC CGTAGTGGGA ATCAC AC TAT GGGCCAGTGA GATCCCCGGG 900 
211 VIGY AYT VVG ITLW ASE IPG 230 



901 GACTCCTCTG ACCGCTACCA CGAGCAAGTC TCTGCCAAGC GCAAGGTGGT CAAAATGATG 96 0 
231 DSSD RYH EQV SAKR KVV KMM 250 
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Figure 10B 

961 ATTGTCGTGG TGTGCACCTT CGCCATCTGC TGGCTGCCCT TCCACATCTT CTTCCTCCTG 102 0 

251 IVVV CTF AIC WLPF HIF F L L 270 

1021 CCCTACATCA ACCCAGATCT CTACCTGAAG AAGTTTATCC AGCAGGTCTA CCTGGCCATC 1080 

271 PYIN PDL YLK KFIQ QVY LAI 290 



1081 ATGTGGCTGG CCATGAGCTC C AC C ATGT AC AACCCCATCA TCTACTGCTG CCTCAATGAC 1140 
291 MWLA MSS TMY NPII YCC LWD 310 



1141 AGGTTCCGTC TGGGCTTCAA GCATGCCTTC CGGTGCTGCC CCTTCATCAG CGCCGGCGAC 1200 
311 RFRL GFK HAF RCCP FIS AGD 330 



12 01 TATGAGGGGC TGGAAATGAA ATCCACCCGG TATCTCCAGA CCCAGGGCAG TGTGTACAAA 1260 
331 YEGL EMK STR YLQT QGS VYK 350 



12 61 GTCAGCCGCC TGGAGACCAC CATCTCCACA GTGGTGGGGG CCCACGAGGA GGAGCCAGAG 132 0 
351 VSRL ETT 1ST VVGA HEE EPE 370 



1321 GACGGCCCCA AGGCCACACC CTCATCCCTG GACCTGACCT CCAACTGCTC TTCACGAAGT 138 0 
371 DGPK ATP SSL DLTS NCS SRS 390 



1381 GACTCCAAGA CCATGACAGA GAGCTTCAGC TTCTCCTCCA ATGTGCTCTC CTAGGCCACA 1440 
391 DSKT MTE SFS FSSN VLS * 407 



1441 GGGCCTTTGG CAGGTGCAGC CCCCACTGCC TTTGACCTGC CTCCCTTCAT GCATGGAAAT 1500 



1501 TCCCTTCATC TGGAACCATC AGAAACACCC TCACACTGGG ACTTGCAAAA AGGGTCAGTA 1560 



1561 TGGGTTAGGG AAAACATTCC ATCCTTGAGT CAAAAAATCT CAATTCTTCC CTATCTTTGC 1620 



1621 CACCCTCATG CTGTGTGACT CAAACCAAAT CACTGAACTT TGCTGAGCCT GTAAAATAAA 1680 



1681 AGGTCGGACC AGCTTTTCCT C AAGAGC CCA ATGCATTCCA TTTCTGGAAG TGACTTTGGC 174 0 



1741 TGCATGCGAG TGCTCATTTC AGGATG 17 66 
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Figure 11A 

1 AGTCTGCACT GGAGCTGCCT GGTGACCAGA AGTTTGGAGT CCGCTGACGT CGCCGCCCAG 60 

61 ATGGCCTCCA GGCTGACCCT GCTGACCCTC CTGCTGCTGC TGCTGGCTGG GGATAGAGCC 120 
1MASR LTL LTL LLLL LAG DRA 20 

121 TCCTCAAATC CAAATGCTAC CAGCTCCAGC TCCCAGGATC CAGAGAGTTT GCAAGACAGA 180 
21SSNP NAT SSS SQDP ESL QDR 40 

181 GGCGAAGGGA AGGTCGCAAC AACAGTTATC TCCAAGATGC TATTCGTTGA ACCCATCCTG 240 
41GEGK VAT TVI SKML FVE P I L 60 

241 GAGGTTTCCA GCTTGCCGAC AACCAACTCA ACAACCAATT CAGCCACCAA AATAACAGCT 300 
61EVSS LPT TNS TTNS ATK ITA 80 



301 AATACCACTG ATGAACCCAC CACACAACCC AC C AC AG AGC CCACCACCCA ACCCACCATC 3 60 
81NTTD EPT TQP TTEP TTQ PTI 100 

361 CAACCCACCC AACCAACTAC CCAGCTCCCA ACAGATTCTC CTACCCAGCC CACTACTGGG 420 
101 QPTQ PTT QLP TDSP TQP TTG 120 

421 TCCTTCTGCC CAGGACCTGT TACTCTCTGC TCTGACTTGG AGAGTCATTC AACAGAGGCC 480 
121 SFCP GPV TLC SDLE SHS TEA 140 

481 GTGTTGGGGG ATGCTTTGGT AGATTTCTCC CTGAAGCTCT ACCACGCCTT CTCAGCAATG 540 
141 VLGD ALV DFS LKLY HAF SAM 160 

541 AAGAAGGTGG AGACCAACAT GGCCTTTTCC CCATTCAGCA TCGCCAGCCT CCTTACCCAG 600 
161KKVE TNM AFS PFSI ASL LTQ 180 



601 GTCCTGCTCG GGGCTGGGCA GAACACCAAA ACAAACCTGG AGAGCATCCT CTCTTACCCC 660 
181 VLLG A G Q NTK TNLE SIL SYP 200 

661 AAGGACTTCA CCTGTGTCCA CCAGGCCCTG AAGGGCTTCA CGACCAAAGG TGTCACCTCA 7 20 
201 KDFT CVH QAL KGFT TKG VTS 220 



721 GTCTCTCAGA TCTTCCACAG CCCAGACCTG GCCATAAGGG ACACCTTTGT GAATGCCTCT 780 

221 VSQI FHS PDL AIRD TFV N A S 240 

781 CGGACCCTGT ACAGCAGCAG CCCCAGAGTC CTAAGCAACA ACAGTGACGC CAACTTGGAG 840 

241 RTLY SSS PRV LSNN SDA NLE 260 



841 CTCATCAACA CCTGGGTGGC CAAGAACACC AACAACAAGA TCAGCCGGCT GCTAGACAGT 900 
261 LINT WVA KNT NNKI SRL LDS 280 



901 CTGCCCTCCG ATACCCGCCT TGTCCTCCTC AATGCTATCT ACCTGAGTGC CAAGTGGAAG 960 
281LPSD TRL VLL NAIY LSA KWK 300 
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Figure 11B 



961 ACAACATTTG ATCCCAAGAA AACCAGAATG GAACCCTTTC ACTTCAAAAA CTCAGTTATA 1020 
301 TTFD PKK TRM EPFH FKN SVI 320 



1021 AAAGTGCCCA TGATGAATAG CAAGAAGTAC CCTGTGGCCC ATTTCATTGA CCAAACTTTG 1080 
321 KVPM MNS KKY PVAH FID QTL 340 



1081 AAAGCCAAGG TGGGGCAGCT GCAGCTCTCC CACAATCTGA GTTTGGTGAT CCTGGTACCC 1140 
341 KAKV GQL QLS HNLS LVI LVP 360 

1141 CAGAACCTGA AACATCGTCT TGAAGACATG GAACAGGCTC TCAGCCCTTC TGTTTTCAAG 1200 
361QNLK HRL EDM EQAL SPS VFK 380 

1201 GCCATCATGG AGAAACTGGA GATGTCCAAG TTCCAGCCCA CTCTCCTAAC ACTACCCCGC 126 0 
381 AIME KLE MSK FQPT LLT LPR 400 



1261 ATCAAAGTGA CGACCAGCCA GGATATGCTC TCAATCATGG AGAAATTGGA ATTCTTCGAT 132 0 
401 IKVT TSQ DML SIME KLE FFD 420 

1321 TTTTCTTATG ACCTTAACCT GTGTGGGCTG ACAGAGGACC CAGATCTTCA GGTTTCTGCG 13 80 
421 FSYD LNL CGL TEDP DLQ VSA 440 



13 81 ATGCAGCACC AGACAGTGCT GGAACTGACA GAGACTGGGG TGGAGGCGGC TGCAGCCTCC 1440 
441 MQHQ TVL ELT ETGV E A A A A S 460 

1441 GCCATCTCTG TGGCCCGCAC CCTGCTGGTC TTTGAAGTGC AGCAGCCCTT CCTCTTCGTG 150 0 
461 AISV ART LLV FEVQ QPF LFV 480 

1501 CTCTGGGACC AGCAGCACAA GTTCCCTGTC TTCATGGGGC GAGTATATGA CCCCAGGGCC 15 60 
481 LWDQ QHK FPV FMGR VYD PRA 500 

1561 TGAGACCTGC AGGATCAGGT TAGGGCGAGC GCTACCTCTC CAGCCTCAGC TCTCAGTTGC 1620 

1621 AGCCCTGCTG CTGCCTGCCT GGACTTGCCC CTGCCACCTC CTGCCTCAGG TGTCCGCTAT 1680 



1681 CCACCAAAAG GGCTCCTGAG GGTCTGGGCA AGGGACCTGC TTCTATTAGC CCTTCTCCAT 17 40 



1741 GGCCCTGCCA TGCTCTCCAA ACCACTTTTT GCAGCTTTCT CTAGTTCAAG TTCACCAGAC 1800 



1801 TCTATAAATA AAACCTGACA GACCAT 182 6 
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Figure 12A 

1 AGTCTGCACT GGAGCTGCCT GGTGACCAGA AGTTTGGAGT CCGCTGACGT CGCCGCCCAG 60 



61 ATGGCCTCCA GGCTGACCCT GCTGACCCTC CTGCTGCTGC TGCTGGCTGG GGATAGAGCC 120 
1MASR LTL LTL L L L L LAG DRA 20 

121 TCCTCAAATC CAAATGCTAC CAGCTCCAGC TCCCAGGATC CAGAGAGTTT GCAAGACAGA 180 
21SSNP NAT SSS SQDP ESL QDR 40 

181 GGCGAAGGGA AGGTCGCAAC AACAGTTATC TCCAAGATGC TATTCGTTGA ACCCATCCTG 240 
41GEGK VAT TVI S K M L FVE P I L 60 

241 GAGGTTTCCA GCTTGCCGAC AACCAACTCA ACAACCAATT CAGCCACCAA AATAACAGCT 3 00 
61 E V S S LPT TNS TTNS ATK ITA 80 

301 AATACCACTG ATGAACCCAC CACACAACCC ACCACAGAGC CCACCACCCA ACCCACCATC 360 
81NTTD EPT TQP TTEP TTQ PTI 100 



3 61 CAACCCACCC AACCAACTAC CCAGCTCCCA ACAGATTCTC CTACCCAGCC CACTACTGGG 420 
101 QPTQ PTT QLP TDSP TQP TTG 120 

421 TCCTTCTGCC CAGGACCTGT TACTCTCTGC TCTGACTTGG AGAGTCATTC AACAGAGGCC 480 
121 SFCP GPV TLC SDLE SHS TEA 140 

481 GTGTTGGGGG ATGCTTTGGT AGATTTCTCC CTGAAGCTCT ACCACGCCTT CTCAGCAATG 540 
141 VLGD ALV DFS LKLY HAF SAM 160 



541 AAGAAGGTGG AGACCAACAT GGCCTTTTCC CCATTCAGCA TCGCCAGCCT CCTTACCCAG 600 
161KKVE TNM AFS PFSI ASL LTQ 180 



601 GTCCTGCTCG GGGCTGGGCA GAACACCAAA ACAAACCTGG AGAGCATCCT CTCTTACCCC 66 0 

181 VLLG AGQ NTK TNLE SIL SYP 200 

661 AAGGACTTCA CCTGTGTCCA CCAGGCCCTG AAGGGCTTCA CGACCAAAGG TGTCACCTCA 72 0 

201 KDFT CVH QAL KGFT TKG VTS 220 

721 GTCTCTCAGA TCTTCCACAG CCCAGACCTG GCCATAAGGG ACACCTTTGT GAATGCCTCT 780 

221 VSQI FHS PDL AIRD TFV NAS 240 

7 81 CGGACCCTGT ACAGCAGCAG CCCCAGAGTC CTAAGCAACA ACAGTGACGC CAACTTGGAG 840 

241 RTLY SSS PRV LSNN SDA NLE 260 



841 CTCATCAACA CCTGGGTGGC CAAGAACACC AACAACAAGA TCAGCCGGCT GCTAGACAGT 900 

261 LINT WVA KNT NNKI SRL LDS 280 

9 01 CTGCCCTCCG ATACCCGCCT TGTCCTCCTC AATGCTATCT ACCTGAGTGC CAAGTGGAAG 960 

281 LPSD TRL VLL NAIY LSA KWK 300 
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Figure 12B 



9 61 ACAACATTTG ATCCCAAGAA AACCAGAATG GAACCCTTTC ACTTCAAAAA CTCAGTTATA 102 0 
301 TTFD PKK TRM EPFH FKN SVI 320 



1021 AAAGTGCCCA TGATGAATAG CAAGAAGTAC CCTGTGGCCC ATTTCATTGA CCAAACTTTG 1080 
321 KVPM MNS KKY PVAH FID QTL 340 



1081 AAAGCCAAGG TGGGGCAGCT GCAGCTCTCC CACAATCTGA GTTTGGTGAT CCTGGTACCC 1140 
341KAKV GQL QLS HNLS L V I LVP 360 



1141 CAGAACCTGA AACATCGTCT TGAAGACATG GAACAGGCTC TCAGCCCTTC TGTTTTCAAG 1200 
361 QNLK HRL EDM EQAL SPS VFK 380 

1201 GCCATCATGG AGAAACTGGA GATGTCCAAG TTCCAGCCCA CTCTCCTAAC ACTACCCCGC 1260 
381 AIME RLE MSR FQPT LLT LPR 400 

1261 ATCAAAGTGA CGACCAGTCA GGATATGCTC TCAATCATGG AGAAATTGGA ATTCTTCGAT 1320 
401 IRVT TSQ DML SIME KLE FFD 420 

1321 TTTTCTTATG ACCTTAACCT GTGTGGGCTG ACAGAGGACC CAGATCTTCA GGTTTCTGCG 1380 
421 FSYD LNL CGL TEDP DLQ VSA 440 



1381 ATGCAGCACC AG AC AGTGC T GGAACTGACA GAGACTGGGG TGGAGGCGGC TGCAGCCTCC 1440 

441 MQHQ TVL ELT ETGV E A A AAS 460 

1441 GCCATCTCTG TGGCCCGCAC CCTGCTGGTC TTTGAAGTGC AGCAGCCCTT CCTCTTCGTG 1500 

461 AISV ART LLV FEVQ QPF LFV 480 



1501 CTCTGGGACC AGCAGCACAA GTTCCCTGTC TTCATGGGGC GAGTATATGA CCCCAGGGCC 156 0 
481 LWDQ QHK FPV FMGR VYD PRA 500 



1561 TGAGACCTGC AGGATCAGGT TAGGGCGAGC GCTACCTCTC CAGCCTCAGC TCTCAGTTGC 162 0 
1621 AGCCCTGCTG CTGCCTGCCT GGACTTGCCC CTGCCACCTC CTGCCTCAGG TGTCCGCTAT 1680 



1681 CCACCAAAAG GGCTCCTGAG GGTCTGGGCA AGGGAC C TGC TTCTATTAGC CCTTCTCCAT 1740 



1741 GGCCCTGCCA TGCTCTCCAA ACCACTTTTT GCAGCTTTCT CTAGTTCAAG TTCACCAGAC 1800 



1801 TCTATAAATA AAACCTGACA GACCAT 1826 
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Figure 13A 

1 AGTCTGCACT GGAGCTGCCT GGTGACCAGA AGTTTGGAGT CCGCTGACGT CGCCGCCCAG 60 



61 ATGGCCTCCA GGCTGACCCT GCTGACCCTC CTGC TGCTGC TGCTGGCTGG GGATAGAGCC 120 
1MASR LTL LTL LLLL LAG DRA 20 



121 TCCTCAAATC CAAATGCTAC CAGCTCCAGC TCCCAGGATC CAGAGAGTTT GCAAGACAGA 180 
21S S N P NAT SSS SQDP ESL QDR 40 



181 GGCGAAGGGA AGGTCGCAAC AACAGTTATC TCCAAGATGC TATTCGCTGA ACCCATCCTG 240 

41GEGK VAT TVI SKML F A E P I L 60 

241 GAGGTTTCCA GCTTGCCGAC AACCAACTCA ACAACCAATT CAGCCACCAA AATAACAGCT 3 00 

61EVSS LPT TNS TTNS ATK ITA 80 



301 AATACCACTG ATGAACCCAC CACACAACCC ACCACAGAGC CCACCACCCA ACCCACCATC 3 60 
81NTTD EPT TQP TTEP TTQ PTI 100 

3 61 CAACCCACCC AACCAACTAC CCAGCTCCCA ACAGATTCTC CTACCCAGCC CACTACTGGG 42 0 
101 QPTQ PTT QLP TDSP TQP TTG 120 

421 TCCTTCTGCC CAGGACCTGT TACTCTCTGC TCTGACTTGG AGAGTCATTC AACAGAGGCC 480 
121SFCP GPV TLC SDLE SHS TEA 140 



481 GTGTTGGGGG ATGCTTTGGT AGATTTCTCC CTGAAGCTCT ACCACGCCTT CTCAGCAATG 540 
141 VLGD A L V DFS LKLY HAF SAM 160 



541 AAGAAGGTGG AGACCAACAT GGCCTTTTCC CCATTCAGCA TCGCCAGCCT CCTTACCCAG 600 
161 KKVE TNM AFS PFSI ASL LTQ 180 



601 GTCCTGCTCG GGGCTGGGCA GAACACCAAA ACAAACCTGG AGAGCATCCT CTCTTACCCC 660 
181VLLG AGQ NTK TNLE SIL SYP 200 



661 AAGGACTTCA CCTGTGTCCA CCAGGCCCTG AAGGGCTTCA CGACCAAAGG TGTCACCTCA 72 0 
201 KDFT CVH QAL KGFT TKG VTS 220 



721 GTCTCTCAGA TCTTCCACAG CCCAGACCTG GCCATAAGGG ACACCTTTGT GAATGCCTCT 7 80 

221 VSQI FHS PDL AIRD TFV NAS 240 

781 CGGACCCTGT ACAGCAGCAG CCCCAGAGTC CTAAGCAACA ACAGTGACGC CAACTTGGAG 840 

241 RTLY SSS PRV LSNN SDA NLE 260 



841 CTCATCAACA CCTGGGTGGC C AAGAAC AC C AACAACAAGA TCAGCCGGCT GCTAGACAGT 900 
261 LINT WVA KNT NNKI SRL LDS 280 



9 01 CTGCCCTCCG ATACCCGCCT TGTCCTCCTC AATGCTATCT ACCTGAGTGC CAAGTGGAAG 96 0 
281 LPSD TRL VLL NAIY LSA KWK 300 
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Figure 13B 



961 ACAACATTTG ATCCCAAGAA AACCAGAATG GAACCCTTTC ACTTCAAAAA CTCAGTTATA 1020 
301 TTFD PKK TRM EPFH FKN SVI 320 



1021 AAAGTGCCCA TGATGAATAG CAAGAAGTAC CCTGTGGCCC ATTTCATTGA CCAAACTTTG 1080 
321 KVPM MNS KKY PVAH FID QTL 340 



1081 AAAGCCAAGG TGGGGCAGCT GCAGCTCTCC CACAATCTGA GTTTGGTGAT CCTGGTACCC 1140 
341 KAKV GQL QLS HNLS LVI LVP 360 

1141 CAGAACCTGA AACATCGTCT TGAAGACATG GAACAGGCTC TCAGCCCTTC TGTTTTCAAG 1200 
361 QNLK HRL EDM EQAL SPS VFK 380 

12 01 GCCATCATGG AGAAACTGGA GATGTCCAAG TTCCAGCCCA CTCTCCTAAC ACTACCCCGC 12 60 
381 AIME KLE MSK FQPT LLT LPR 400 

1261 ATCAAAGTGA CGACCAGCCA GGATATGCTC TCAATCATGG AGAAATTGGA ATTCTTCGAT 132 0 
401 IKVT TSQ DML SIME KLE FFD 420 

1321 TTTTCTTATG ACCTTAACCT GTGTGGGCTG ACAGAGGACC CAGATCTTCA GGTTTCTGCG 138 0 
421 FSYD LNL CGL TEDP DLQ VSA 440 

1381 ATGCAGCACC AGACAGTGCT GGAACTGACA GAGACTGGGG TGGAGGCGGC TGCAGCCTCC 1440 
441 MQHQ TVL ELT ETGV E A A A A S 460 

1441 GCCATCTCTG TGGCCCGCAC CCTGCTGGTC TTTGAAGTGC AGCAGCCCTT CCTCTTCGTG 1500 
461 AISV ART LLV FEVQ QPF L.FV 480 



1501 CTCTGGGACC AGCAGCACAA GTTCCCTGTC TTCATGGGGC GAGTATATGA CCCCAGGGCC 1560 
481 LWDQ QHK FPV FMGR VYD PRA 500 

1561 TGAGACCTGC AGGATCAGGT TAGGGCGAGC GCTACCTCTC CAGCCTCAGC TCTCAGTTGC 162 0 



1621 AGCCCTGCTG CTGCCTGCCT GGACTTGCCC CTGCCACCTC CTGCCTCAGG TGTCCGCTAT 1680 
1681 CCACCAAAAG GGCTCCTGAG GGTCTGGGCA AGGGACCTGC TTCTATTAGC CCTTCTCCAT 1740 



1741 GGCCCTGCCA TGCTCTCCAA ACCACTTTTT GCAGCTTTCT CTAGTTCAAG TTCACCAGAC 1800 



1801 TCTATAAATA AAACCTGACA GACCAT 1826 
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Figure 14A 

1 AGTCTGCACT GGAGCTGCCT GGTGACCAGA AGTTTGGAGT CCGCTGACGT CGCCGCCCAG 60 

61 ATGGCCTCCA GGCTGACCCT GCTGACCCTC CTGCTGCTGC TGCTGGCTGG GGATAGAGCC 12 0 
1MASR LTL LTL L L L L LAG DRA 20 

121 TCCTCAAATC CAAATGCTAC CAGCTCCAGC TCCCAGGATC CAGAGAGTTT GCAAGACAGA 180 
21SSNP NAT SSS SQDP ESL QDR 40 

181 GGCGAAGGGA AGGTCGCAAC AACAGTTATC TCCAAGATGC TATTCGTTGA ACCCATCCTG 240 
41GEGK VAT TVI SKML F V E P I L 60 

2 41 GAGGTTTCCA GCTTGCCGAC AACCAACTCA ACAACCAATT CAGCCACCAA AATAACAGCT 300 
61EVSS LPT TNS TTNS ATK ITA 80 



3 01 AATACCACTG ATGAACCCAC CACACAACCC AC C AC AGAGC CCACCACCCA ACCCACCATC 3 60 
81NTTD EPT TQP TTEP TTQ PTI 100 



361 CAACCCACCC AACCAACTAC CCAGCTCCCA ACAGATTCTC CTACCCAGCC CACTACTGGG 420 

101 QPTQ PTT QLP TDSP TQP TTG 120 

421 TCCTTCTGCC CAGGACCTGT TACTCTCTGC TCTGACTTGG AGAGTCATTC AACAGAGGCC 480 

121 SFCP GPV TLC SDLE SHS TEA 140 

481 GTGTTGGGGG ATGCTTTGGT AGATTTCTCC CTGAAGCTCT ACCACGCCTT CTCAG6AATG 540 

141 VLGD ALV DFS LKLY HAF SGM 160 

541 AAGAAGGTGG AGACCAACAT GGCCTTTTCC CCATTCAGCA TCGCCAGCCT CCTTACCCAG 600 

161 KKVE TNM AFS PFSI ASL LTQ 180 



601 GTCCTGCTCG GGGCTGGGCA GAACACCAAA ACAAACCTGG AGAGCATCCT CTCTTACCCC 660 
181VLLG AGQ NTK TNLE SIL SYP 200 

661 AAGGACTTCA CCTGTGTCCA CCAGGCCCTG AAGGGCTTCA CGACCAAAGG TGTCACCTCA 72 0 
201 KDFT CVH QAL KGFT TKG VTS 220 



721 GTCTCTCAGA TCTTCCACAG CCCAGACCTG GCCATAAGGG ACACCTTTGT GAATGCCTCT 7 80 
221 VSQI FHS PDL AIRD TFV NAS 240 

781 CGGACCCTGT ACAGCAGCAG CCCCAGAGTC CTAAGCAACA ACAGTGACGC CAACTTGGAG 840 
241 RTLY SSS PRV LSNN SDA NLE 260 



841 CTCATCAACA CCTGGGTGGC CAAGAACACC AACAACAAGA TCAGCCGGCT GCTAGACAGT 900 
261 LINT WVA KNT NNKI SRL LDS 280 



901 CTGCCCTCCG ATACCCGCCT TGTCCTCCTC AATGCTATCT ACCTGAGTGC CAAGTGGAAG 960 
281 LPSD TRL VLL NAIY LSA KWK 300 
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Figure 14B 



961 ACAACATTTG ATCCCAAGAA AACCAGAATG GAACCCTTTC ACTTCAAAAA CTCAGTTATA 102 0 
301 TTFD PKK TRM EPFH FKN SVI 320 



1021 AAAGTGCCCA TGATGAATAG CAAGAAGTAC CCTGTGGCCC ATTTCATTGA CCAAACTTTG 1080 
321 KVPM MNS KKY PVAH FID QTL 340 



1081 AAAGCCAAGG TGGGGCAGCT GCAGCTCTCC CACAATCTGA GTTTGGTGAT CCTGGTACCC 1140 
341 KAKV GQL QLS HNLS LVI LVP 360 



1141 CAGAACCTGA AACATCGTCT TGAAGACATG GAACAGGCTC TCAGCCCTTC TGTTTTCAAG 120 0 

361QNLK H R L EDM EQAL SPS VFK 380 

1201 GCCATCATGG AGAAACTGGA GATGTCCAAG TTCCAGCCCA CTCTCCTAAC ACTACCCCGC 12 60 

381 AIME KLE MSK FQPT LLT LPR 400 

12 61 ATCAAAGTGA CGACCAGCCA GGATATGCTC TCAATCATGG AGAAATTGGA ATTCTTCGAT 132 0 

401 IKVT TSQ DML SIME KLE FFD 420 

1321 TTTTCTTATG ACCTTAACCT GTGTGGGCTG ACAGAGGACC CAGATCTTCA GGTTTCTGCG 1380 

421 FSYD LNL CGL TEDP DLQ VSA 440 



1381 ATGCAGCACC AGACAGTGCT GGAAC TGAC A GAGAC TGGGG TGGAGGCGGC TGCAGCCTCC 1440 

441 MQHQ TVL ELT ETGV E A A A A S 460 

1441 GCCATCTCTG TGGCCCGCAC CCTGCTGGTC TTTGAAGTGC AGCAGCCCTT CCTCTTCGTG 1500 

461 AISV ART DLV FEVQ QPF LFV 480 

1501 CTCTGGGACC AGCAGCACAA GTTCCCTGTC TTCATGGGGC GAGTATATGA CCCCAGGGCC 15 60 

481 LWDQ QHK FPV FMGR VYD PRA 500 

1561 TGAGACCTGC AGGATCAGGT TAGGGCGAGC GCTACCTCTC CAGCCTCAGC TCTCAGTTGC 1620 



1621 AGCCCTGCTG CTGCCTGCCT GGACTTGCCC CTGCCACCTC CTGCCTCAGG TGTCCGCTAT 1680 



1681 CCACCAAAAG GGCTCCTGAG GGTCTGGGCA AGGGACC TGC TTCTATTAGC CCTTCTCCAT 1740 



1741 GGCCCTGCCA TGCTCTCCAA ACCACTTTTT GCAGCTTTCT CTAGTTCAAG TTCACCAGAC 18 00 



1801 TCTATAAATA AAACCTGACA GACCAT 182 6 
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1 AGTCTGCACT GGAGCTGCCT GGTGACCAGA 



61 ATGGCCTCCA GGCTGACCCT GCTGACCCTC 
1MASR LTL LTL 



121 TCCTCAAATC CAAATGCTAC CAGCTCCAGC 
21SSNP NAT SSS 



181 GGCGAAGGGA AGGTCGCAAC AACAGTTATC 
41GEGK VAT TVI 

241 GAGGTTTCCA GCTTGCCGAC AACCAACTCA 
61EVSS LPT TNS 

301 AATACCACTG ATGAACCCAC CACACAACCC 
81NTTD EPT TQP 

361 CAACCCACCC AACCAACTAC CCAGCTCCCA 
101 QPTQ PTT QLP 

421 TCCTTCTGCC CAGGACCTGT TACTCTCTGC 
121 SFCP GPV TLC 



481 GTGTTGGGGG ATGCTTTGGT AGATTTCTCC 
141VLGD ALV DFS 



541 AAGAAGGTGG AGACCAACAT GGCCTTTTCC 
161 KKVE TNM AFS 



601 GTCCTGCTCG GGGCTGGGCA GAACACCAAA 
181 VLLG AGQ NTK 



661 AAGGACTTCA CCTGTGTCCA CCAGGCCCTG 
201KDFT CVH QAL 

721 GTCTCTCAGA TCTTCCACAG CCCAGACCTG 
221 VSQI FHS PDL 

7 81 CGGACCCTGT ACAGCAGCAG CCCCAGAGTC 
241 RTLY SSS PRV 



841 CTCATCAACA CCTGGGTGGC CAAGAACACC 
261 LINT WVA KNT 

901 CTGCCCTCCG ATACCCGCCT TGTCCTCCTC 
281 LPSD TRL VLL 



•e 15A 

AGTTTGGAGT CCGCTGACGT CGCCGCCCAG 60 



CTGCTGCTGC TGCTGGCTGG GGATAGAGCC 120 
LLLL LAG DRA 20 

TCCCAGGATC CAGAGAGTTT GCAAGACAGA 180 
SQDP ESL QDR 40 

TCCAAGATGC TATTCGTTGA ACCCATCCTG 240 
SKML FVE PIL 60 

ACAACCAATT CAGCCACCAA AATAAC AGC T 3 00 
TTNS ATK ITA 80 

ACCACAGAGC CCACCACCCA ACCCACCATC 3 60 
TTEP TTQ PTI 100 



ACAGATTCTC CTACCCAGCC CACTACTGGG 420 
TDSP TQP TTG 120 



TCTGACTTGG AGAGTCATTC AACAGAGGCC 480 
SDLE SHS TEA 140 



CTGAAGCTCT ACCACGCCTT CTCAGCAATG 540 
LKLY HAF SAM 160 



CCATTCAGCA TCGCCAGCCT CCTTACCCAG 6 00 
PFSI ASL LTQ 180 



ACAAACCTGG AGAGCATCCT CTCTTACCCC 6 60 
TNLE SIL SYP 200 



AAGGGCTTCA CGACCAAAGG TGTCACCTCA 72 0 
KGFT TKG VTS 220 

GCCATAAGGG ACACCTTTGT GAATGCCTCT 780 
AIRD TFV NAS 240 

CTAAGCAACA ACAGTGACGC CAACTTGGAG 840 
LSNN SDA NLE 260 

AACAACAAGA TCAGCCGGCT GCTAGACAGT 900 
NNKI SRL LDS 280 



AATGCTATCT ACCTGAGTGC CAAGTGGAAG 960 
NAIY LSA KWK 300 
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Figure 15B 



961 ACAACATTTG ATCCCAAGAA AACCAGAATG GAACCCTTTC ACTTCAAAAA CTCAGTTATA 102 0 
301 TTFD PKK TRM EPFH FKN SVI 320 



1021 AAAGTGCCCA TGATGAATAG CAAGAAGTAC CCTGTGGCCC ATTTCATTGA CCAAACTTTG 1080 
321 KVPM MNS KKY PVAH FID QTL 340 



1081 AAAGCCAAGG TGGGGCAGCT GCAGCTCTCC CACAATCTGA GTTTGGTGAT CCTGGTACCC 1140 
341 KAKV GQL QLS HNLS L V I LVP 360 

1141 CAGAACCTGA AACATCGTCT TGAAGACATG GAACAGGCTC TCAGCCCTTC TGTTTTCAAG 1200 
361 QNLK H R L, EDM EQAL SPS VFK 380 

1201 GCCATCATGG AGAAAC TGGA GATGTCCAAG TTCCAGCCCA CTCTCCTAAC ACTACCCCGC 1260 
381 AIME RLE MSK FQPT LLT LPR 400 



1261 ATCAAAGTGA CGACCAGCCA GGATATGCTC TCAATCATGG AGAAATTGGA ATTCTTCGAT 132 0 
401 IRVT TSQ DML SIME RLE FFD 420 



1321 TTTTCTTATG ACCTTAACCT GTGTGGGCTG ACAGAGGACC CAGATCTTCA GGTTTCTGCG 13 8 0 
421 FSYD LNL CGL TEDP DLQ VSA 440 



1381 ATGCAGCACC AGACAGTGCT GGAACTGACA GAGACTGGGG TGGAGGCGGC TGCAGCCTCC 1440 

441 MQHQ TVL ELT ETGV E A A AAS 460 

1441 GCCATCTCTG TGGCCCGCAC CCTGCTGGTC TTTGAAGTGC AGCAGCCCTT CCTCTTCATG 1500 

461 AISV ART LLV FEVQ QPF LFM 480 



1501 CTCTGGGACC AGCAGCACAA GTTCCCTGTC TTCATGGGGC GAGTATATGA CCCCAGGGCC 1560 
481 LWDQ QHR FPV FMGR VYD PRA 500 

1561 TGAGACCTGC AGGATCAGGT TAGGGCGAGC GCTACCTCTC CAGCCTCAGC TCTCAGTTGC 162 0 



1621 AGCCCTGCTG CTGCCTGCCT GGACTTGCCC CTGCCACCTC CTGCCTCAGG TGTCCGCTAT 1680 
1681 CCACCAAAAG GGCTCCTGAG GGTCTGGGCA AGGGACCTGC TTCTATTAGC CCTTCTCCAT 1740 



1741 GGCCCTGCCA TGCTCTCCAA ACCACTTTTT GCAGCTTTCT CTAGTTCAAG TTCACCAGAC 1800 



1801 TCTATAAATA AAACCTGACA GACCAT 1826 
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Figure 16 

1 TCCTCCACCT GCTGGCCCCT GGACACCTCT GTCACCATGT GGTTCCTGGT TCTGTGCCTC 60 
1 MWFLVLCL8 



61 GCCCTGTCCC TGGGGGGGAC TGGTGCTGCG CCCCCGATTC AGTCCCGGAT TGTGGGAGGC 12 0 
9ALSL GGT G A A PPIQ SRI VGG 28 



121 TGGGAGTGTG AGCAGCATTC CCAGCCCTGG CAGGCGGCTC TGTACCATTT CAGCACTTTC 180 
29WECE QHS QPW QAAL YHF STF 48 



181 CAGTGTGGGG GCATCCTGGT GCACCGCCAG TGGGTGCTCA CAGCTGCTCA TTGCATCAGC 240 
49QCGG I L V HRQ WVLT A A H CIS 68 

241 GACAATTACC AGCTCTGGCT GGGTCGCCAC AACTTGTTTG ACGACGAAAA CACAGCCCAG 3 00 
69DNYQ LWL GRH NLFD DEN TAQ 88 

3 01 TTTGTTCATG TCAGTGAGAG CTTCCCACAC CCTGGCTTCA ACATGAGCCT CCTGGAGAAC 360 
89FVHV SES FPH PGFN MSL LEN 108 



361 CACACCCGCC AAGCAGACGA GGACTACAGC CACGACCTCA TGCTGCTCCG CCTGACAGAG 420 
109 HTRQ ADE DYS HDLM LLR LTE 128 



421 CCTGCTGATA CCATCACAGA TGCTGTGAAG GTCGTGGAGT TGCCCACCGA GGAACCCGAA 480 

129 PADT ITD A V K VVEL PTE EPE 148 

481 GTGGGGAGCA CCTGTTTGGC TTCCGGCTGG GGCAGCATCG AACCAGAGAA TTTCTCATTT 540 

149 VGST C Li A SGW GSIE PEN FSF 168 



541 CCAGATGATC TCCAGTGTGT GGACCTCAAA ATCCTGCCTA ATGATGAGTG CAAAAAAGCC 600 
169 PDDL QCV D L K ILPN DEC KKA 188 



601 CACGTCCAGA AGGTGACAGA CTTCATGCTG TGTGTCGGAC ACCTGGAAGG TGGCAAAGAC 660 
189 HVQK VTD FML CVGH LEG GKD 208 



661 ACCTGTGTGG GTGATTCAGG GGGCCCGCTG ATGTGTGATG GTGTGCTCCA AGGTGTCACA 72 0 
209 TCVG DSG GPL MCDG VLQ GVT 228 



721 TCATGGGGCT ACGTCCCTTG TGGCACCCCC AATAAGCCTT CTGTCGCCGT CAGAGTGCTG 7 80 
229 SWGY VPC GTP NKPS VAV RVL 248 

781 TCTTATGTGA AGTGGATCGA GGACACCATA GCGGAGAACT CCTGAACGCC CAGCCCTGTC 840 
249 SYVK WIE DTI AENS * 262 



841 CCCTACCCCC AGTAAAATCA AATGTGCATC C 871 
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1 TCCTCCACCT GCTGGCCCCT GGACACCTCT 
1 

61 GCCCTGTCCC TGGGGGGGAC TGGTGCTGCG 
9ALSL GGT GAA 

121 TGGGAGTGTG AGCAGCATTC CCAGCCCTGG 
29WECE QHS QPW 



181 CAGTGTGGGG GCATCCTGGT GCACCGCCAG 
49QCGG I L V HRQ 



241 GACAATTACC AGCTCTGGCT GGGTCGCCAC 
69DNYQ LWL GRH 



3 01 TTTGTTCATG TCAGTGAGAG CTTCCCACAC 
89FVHV SES FPH 



3 61 CACACCCGCC AAGCAGACGA GGACTACAGC 
109 HTRQ ADE DYS 



421 CCTGCTGATA CCATCACAGA TGCTGTGAAG 
129 PADT ITD A V K 



481 GTGGGGAGCA CCTGTTTGGC TTCCGGCTGG 
149 VGST C L A SGW 

541 CCAGATGATC TCCAGTGTGT GGACCTCAAA 
169 PDDL QCV D L K 

601 CACGTCCAGA AGGTGACAGA CTTCATGCTG 
189 HVQK VTD F M L 

661 ACCTGTGTGG GTGATTCAGG GGGCCCGCTG 
209 TCVG DSG GPL 



721 TCATGGGGCT ACGTCCCTTG TGGCACCCCC 
229 SWGY VPC GTP 



781 TCTTATGTGA AGTGGATCGA GGACACCATA 
249 SYVK WIE DTI 



re 17 

GTCACCATGT GGTTCCTGGT TCTGTGCCTC 60 

MW FLV LCL 8 

CCCCCGATTC AGTCCCGGAT TGTGGGAGGC 120 

PPIQ SRI VGG 28 

CAGGCGGCTC TGTACCATTT CAGCACTTTC 180 

Q A A L YHF STF 48 

TGGGTGCTCA CAGCTGCTCA TTGCATCAGC 240 

WVLT A A H CIS 68 



AACTTGTTTG ACGACGAAAA CACAGCCCAG 300 
NLFD DEN TAQ 88 

CCTGGCTTCA ACATGAGCCT CCTGGAGAAC 360 
PGFN M S L LEN 108 



CACGACCTCA TGCTGCTCCG CCTGACAGAG 42 0 
HDLM LLR LTE 128 



GTCGTGGAGT TGCCCACCGA GGAACCCGAA 480 
VVEL PTE EPE 148 



GGCAGCATCG AACCAGAGAA TTTCTCATTT 540 

GSIE PEN FSF 168 

ATCCTGCCTA ATGATGAGTG C6AAAAAGCC 600 

ILPN DEC EKA 188 

TGTGTCGGAC ACCTGGAAGG TGGCAAAGAC 660 

CVGH LEG GKD 208 



ATGTGTGATG GTGTGCTCCA AGGTGTCACA 720 
MCDG VLQ GVT 228 



AATAAGCCTT CTGTCGCCGT CAGAGTGCTG 780 
NKPS VAV RVL 248 



GCGGAGAACT CCTGAACGCC CAGCCCTGTC 840 
A E N S * 262 



841 CCCTACCCCC AGTAAAATCA AATGTGCATC C 871 
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1 TCCTCCACCT GCTGGCCCCT GGACACCTCT GTCACCATGT GGTTCCTGGT TCTGTGCCTC 60 
1 MWFLVLCL8 



61 GCCCTGTCCC TGGGGGGGAC TGGTGCTGCG CCCCCGATTC AGTCCCGGAT TGTGGGAGGC 120 
9ALSL GGT G A A PPIQ SRI VGG 28 

121 TGGGAGTGTG AGCAGCATTC CCAGCCCTGG CAGGCGGCTC TGTACCATTT CAGCACTTTC 180 
29WECE QHS QPW QAAL YHF STF 48 

181 CAGTGTGGGG GCATCCTGGT GCACCGCCAG TGGGTGCTCA CAGCTGCTCA TTGCATCAGC 240 
49QCGG I L V HRQ WVLT A A H CIS 68 

241 GACAATTACC AGCTCTGGCT GGGTCGCCAC AACTTGTTTG ACGACGAAAA CACAGCCCAG 3 00 
69DNYQ LWL GRH NLFD DEN TAQ 88 

3 01 TTTGTTCATG TCAGTGAGAG CTTCCCACAC CCTGGCTTCA AC ATGAGC C T CCTGGAGAAC 3 60 
89FVHV SES FPH PGFN MSL L E N 108 

361 CACACCCGCC AAGCAGACGA GGACTACAGC CACGACCTCA TGCTGCTCCG CCTGACAGAG 420 
109 HTRQ ADE DYS HDLM L L R LTE 128 

421 CCTGCTGATA CCATCACAGA TGCTGTGAAG GTCGTGGAGT TGCCCACCCA GGAACCCGAA 480 
129 PADT ITD A V K VVEL PTQ EPE 148 

481 GTGGGGAGCA CCTGTTTGGC TTCCGGCTGG GGCAGCATCG AACCAGAGAA TTTCTCATTT 540 
149 VGST C L. A SGW GSIE PEN FSF 168 

541 CCAGATGATC TCCAGTGTGT GGACCTCAAA ATCCTGCCTA ATGATGAGTG CAAAAAAGCC 600 
169 PDDL QCV DLK ILPN DEC KKA 188 

601 CACGTCCAGA AGGTGACAGA CTTCATGCTG TGTGTCGGAC ACCTGGAAGG TGGCAAAGAC 660 
189 HVQK VTD FML CVGH LEG GKD 208 

661 ACCTGTGTGG GTGATTCAGG GGGCC CGCTG ATGTGTGATG GTGTGCTCCA AGGTGTCACA 72 0 
209 TCVG DSG GPL MCDG VLQ GVT 228 

721 TCATGGGGCT ACGTCCCTTG TGGCACCCCC AATAAGCCTT CTGTCGCCGT CAGAGTGCTG 7 80 
229 SWGY VPC GTP NKPS VAV RVL 248 

7 81 TCTTATGTGA AGTGGATCGA GGACACCATA GCGGAGAACT CCTGAACGCC CAGCCCTGTC 840 
249 SYVK WIE DTI AENS * 262 



841 CCCTACCCCC AGTAAAATCA AATGTGCATC C 871 
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Figure 19 
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Figure 20 

BRB2_MOUSE MPCSwSlLGF||sKhE . PMPTAaSfG] 

BRB2_RAT MDTRSSLCP . gTQAVgAfflFW . GPGCHLgTC: 
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1 CTGTGCATGG CATCATCCTG GCCCCCTCTA 
1 MASSWPPL 

61 TTCCCTCAAA ATGCTACGGC CTGTGACAAT 
19FPQN ATA CDN 

121 GTGCTGCCNA CATTTATCAT CTCCATCTGT 
39 V L P T FII SIC 



181 CTGTTGGTCT TCCTCCTGCC CCGGCGGCAA 
59 L L V F LLP RRQ 



241 CTGGCAGCCT CTGATCTGGT GTTTGTCTTG 
79LAAS DLV FVL 

301 AACCAGTTTA ACTGGCCTTT CGGAGCCCTC 
99NQFN WPF GAL 

361 GCCAATTTGT TCATCAGCAT CTTCCTGGTG 
119 ANLF ISI FLV 

421 CTGGTGCACC CTATGGCCAG CGGAAGGCAG 
139 LVHP MAS GRQ 

481 GTGCTCATCT GGGTTGTGGG GGGCCTCTTG 
159 VLIW VVG GLL 

541 CAAGCCGTCC CAGATCTGAA CATCACCGCC 
179 QAVP DLN I T A 

601 CACTTTGCAA GGATTGTGGA GTTAAATATT 
199 HFAR IVE LNI 



661 GTCTTCTTCA ACTACCACAT CCTGGCCTCC 
219VFFN YHI LAS 



721 AGAGTGCNGG GGC CGAAGGA TAGCAAGACC 
239 RVXG PKD SKT 



781 TTCCTGGTCT GCTGGGCCCC TTACCACTTC 
259 FLVC WAP YHF 



841 CAAGCAGTCC GAGGCTGCTT TTGGGAGGAC 
279 QAVR GCF WED 



:e 21A 

GAGCTCCAAT CCTCCAACCA GAGCCAGCTC 60 
ELQS SNQ SQL 18 

GCTCCAGAAG CCTGGGACCT GCTGCACAGA 120 
APEA WDL LHR 38 

TTCTTCGGCC TCCTAGGGAA CCTTTTTGTC 180 
FFGL LGN LFV 58 

CTGAACGTGG CAGAAATCTA CCTGGCCAAC 240 
LNVA EIY LAN 78 



GGCTTGCCCT TCTGGGCAGA GAATATCTGG 300 

GLPF WAE NIW 98 

CTCTGCCGTG TCATCAANGG GGTCATCAAG 360 

LCRV ING VIK 118 

GTGGCCATCA GCCAGGACCG CTACCGCGTG 420 

VAIS QDR YRV 138 

CAGCGGCGGA GNCAGGCCCG GGTCACCTGC 480 

QRRR QAR VTC 158 

AGCATCCCCA CATTCCTGCT GCGATCCATC 540 

SIPT FLL RSI 178 



TGCATCNTGC TCCTCCCCCA TGAGGCCTGG 600 
CIXL LPH EAW 198 



CTGGGTTTCC TCCTACCACT GGCTGCGATC 660 
LGFL LPL A A I 218 

CTGCGAACGC GGGANGAGGT CAGCAGGACA 720 
LRTR EEV SRT 238 

ACAGCGCTGA TCCTCACGCT CGTGGTTGCC 7 80 
TALI LTL VVA 258 

TTTGCCTTCC TGGAATTCTT ATTCCAGGTG 840 
FAFL EFL FQV 278 

TTCATTGACC TGGGCCTGCA ATTGGCCAAC 900 
FIDL GLQ LAN 298 
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Figure 2 IB 



901 TTCTTTGCCT TCACTAACAG CTCCCTGAAT CCAGTAATTT ATGTCTTTGT GGGCCNGCTC 960 
2 99 F F A F TNS SLN PVIY VFV GXL 318 



961 TTCAGGACCA AGGTCTGGGA ACTTTATAAA CAATGCACCC CTAAAAGTCT TGCTCCAATA 1020 

319 FRTK VWE LYK QCTP KSL API 338 

1021 TCTTCATCCC ATAGGAAAGA AATCTTCCAA CTTTTCTGGC GGAATTAAAA CAGCATTGAA 1080 

339 SSSH RKE IFQ LFWR N * 353 



1081 CC 1082 



D0053 NP 



Figure 22A 

1 AATTCAGAGC CACCGCGGGC AGGCGGGCAG TGCATCCAGA AGCGTTTATA TTCTGAGCGC 60 



61 CAGTTCAGCT TTCAAAAAGA GTGCTGCCCA TAAAAAGCCT TCCACCCTCC TGTCTGCTTT 120 



121 AGAAGGACCC TGAGCCCCAG GCGCCAGCCA CAGGACTCTG CTGCAGAGGG GGGTTGTGTA 180 



181 CAGATAGTAG GCTTTACGCC TAGCTTCGAA ATGGATAACG TCCTCCCGGT GGACTCAGAC 240 
1 MDNVLPVDSD10 



241 CTCTCCCCAA ACATCTCCAC TAACACCTCG GAACCCAATC AGTTCGTGCA ACCAGCCTGG 300 
11LSPN 1ST NTS EPNQ FVQ PAW 30 



301 CAAATTGTCC TTTGGGCAGC TGCCTACACG GTCATTGTGG TGACCTCTGT GGTGGGCAAC 360 
31QIVL W A A AYT VIVV TSV VGN 50 



361 GTGGTAGTGA TGTGGATCAT CTTAGCCCAC AAAAGAATGA GGACAGTGAC GAACTATTTT 420 
5 1 V V V M WII LAH KRMR TVT NYF 70 



421 CTGGTGAACC TGGCCTTCGC GGAGGCCTCC ATGGCTGCAT TCAATACAGT GGTGAACTTC 480 
71LVNL AFA EAS MAAF WTV VNF 90 



481 ACCTATGCTG TCCACAACGA ATGGTACTAC GGCCTGTTCT ACTGCAAGTT CCACAACTTC 540 
91TYAV H N E WYY G L F Y CKF HNF 110 



541 TTNCCCATCG CCGCTGTCTT CGCCAGTATC TACTCCATGA CGGCTGTGGC CTTTGATAGG 600 
111 FPIA A V F ASI YSMT AVA FDR 130 



601 TACATGGCCA TCATACATCC CCTCCAGCCC CGGCTGTCAG CCACAGCCAC CAAAGTGGTC 660 
131 YMAI IHP LQP RLSA TAT KVV 150 



661 ATCTGTGTCA TNTGGGTCCT GGCTCTCCTG CTGGCCTTCC CCCAGGGCTA CTACTCAACC 720 
151 ICVI W V L ALL LAFP QGY YST 170 



721 ACAGAGACCA TGCCCAGCAG AGTCGTGTGC ATGATCGAAT GGCCAGAGCA TCCGAACAAG 780 
171 TETM PSR VVC MIEW PEH PNK 190 



781 ATTTATGAGA AAGTGTACCA CATCTGTGTG ACTGTGCTGA TCTACTTCCT CCCCCTGCTG 840 
191IYEK VYH ICV TVLI YFL PLL 210 



841 GTGATTGGCT ATGCATACAC CGTAGTGGGA ATCACACTAT GGGCCAGTGA GATCCCCGGG 900 
211 VIGY AYT VVG ITLW ASE IPG 230 



901 GACTCCTCTG ACCGCTACCA CGAGCAAGTC TCTGCCAAGC GCAAGGTGGT CAAAATGATG 960 
231DSSD RYH EQV SAKR KVV KMM 250 
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Figure 22B 

961 ATTGTCGTGG TGTGCACCTT CGCCATCTGC TGGCTGCCCT TCCACATCTT CTTCCTCCTG 1020 
251 IVVV CTF AIC WLPF HIF F L L 270 



1021 CCCTACATCA ACCCAGATCT CTACCTGAAG AAGTTTATCC AGCAGGTCTA CCTGGCCATC 1080 
271 PYIN PDL Y L K KFIQ QVY LAI 290 



1081 ATGTGGC TGG CCATGAGCTC CACCATGTAC AACCCCATCA TCTACTGCTG CCTCAATGAC 1140 
291 MWLA MSS TMY NPII YCC L N D 310 



1141 AGGTTCCGTC TGGGCTTCAA GCATGCCTTC CGGTGCTGCC C CTTCATC AG CGCCGGCGAC 1200 
311 RFRL GFK HAF RCCP FIS AGD 330 



12 01 TATGAGGGGC TGGAAATGAA ATCCACCCGG TATCTCCAGA CCCAGGGCAG TGTGTACAAA 1260 
331 YEGL EMK STR YLQT QGS VYK 350 



12 61 GTCAGCCGCC TGGAGACCAC CATCTCCACA GTGGTGGGGG CCCACGAGGA GGAGCCAGAG 132 0 
351 V S R L ETT 1ST VVGA HEE EPE 370 



1321 GACGGCCCCA AGGCCACACC CTCNTCCCTG GACCTGACCT CCAACTGCTC TTCACGAAGT 1380 
371 DGPK ATP SSL DLTS NCS SRS 390 



1381 GACTCCAAGA CCATGACAGA GAGCTTCAGC TTCTCCTCCA ATGTGCTCTC CTAGGCCACA 1440 
391 DSKT MTE SFS FSSN VLS * 407 



1441 GGGCCTTTGG CAGGTGCAGC CCCCACTGCC TTTGACCTGC CTCCCTTCAT GCATGGAAAT 1500 



1501 TCCCTTCATC TGGAACCATC AGAAACACCC TCACACTGGG ACTTGCAAAA AGGGTCAGTA 1560 



1561 TGGGTTAGGG AAAACATTCC ATCCTTGAGT CAAAAAATCT CAATTCTTCC CTATCTTTGC 1620 



1621 CACCCTCATG CTGTGTGACT CAAACCAAAT CACTGAACTT TGCTGAGCCT GTAAAATAAA 1680 



1681 AGGTCGGACC AGCTTTTCCT CAAGAGCCCA ATGCATTCCA TTTCTGGAAG TGACTTTGGC 1740 



1741 TGCATGCGAG TGCTCATTTC AGGATG 1766 
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Figure 23A 

1 AGTCTGCACT GGAGCTGCCT GGTGACCAGA AGTTTGGAGT CCGCTGACGT CGCCGCCCAG 60 



61 ATGGCCTCCA GGCTGACCCT GCTGACCCTC CTGCTGCTGC TGCTGGCTGG GGATAGAGCC 120 
1MASR LTL LTL LLLL LAG DRA 20 



121 TCCTCAAATC CAAATGCTAC CAGCTCCAGC TCCCAGGATC CAGAGAGTTT GCAAGACAGA 180 

21SSNP NAT SSS SQDP ESL QDR 40 

181 GGCGAAGGGA AGGTCGCAAC AACAGTTATC TCCAAGATGC TATTCGNTGA ACCCATCCTG 240 

41GEGK VAT TVI SKML FXE P I L 60 



241 GAGGTTTCCA GCTTGCCGAC AACCAACTCA ACAACCAATT CAGCCACCAA AATAACAGCT 3 00 
61EVSS LPT TNS TTNS ATK I T A 80 



301 AATACCACTG ATGAACCCAC CACACAACCC AC C AC AG AGC CCACCACCCA ACCCACCATC 3 60 
81NTTD EPT TQP TTEP TTQ PTI 100 



361 CAACCCACCC AACCAACTAC CCAGCTCCCA ACAGATTCTC CTACCCAGCC CACTACTGGG 420 

101 QPTQ PTT QLP TDSP TQP TTG 120 

421 TCCTTCTGCC CAGGACCTGT TACTCTCTGC TCTGACTTGG AGAGTCATTC AACAGAGGCC 480 

121 SFCP GPV TLC SDLE SHS TEA 140 



481 GTGTTGGGGG ATGCTTTGGT AGATTTCTCC CTGAAGCTCT ACCACGCCTT CTCAGNAATG 540 
141 VLGD A L V DFS LKLY HAF SXM 160 



541 AAGAAGGTGG AGACCAACAT GGCCTTTTCC CCATTCAGCA TCGCCAGCCT CCTTACCCAG 600 
161KKVE TNM AFS PFSI ASL LTQ 180 



601 GTCCTGCTCG GGGCTGGGCA GAACACCAAA ACAAACCTGG AGAGCATCCT CTCTTACCCC 6 60 

181 VLLG AGQ NTK TNLE SIL SYP 200 

661 AAGGACTTCA CCTGTGTCCA CCAGGCCCTG AAGGGCTTCA CGACCAAAGG TGTCACCTCA 7 20 

201 KDFT CVH QAL KGFT TKG VTS 220 



721 GTCTCTCAGA TCTTCCACAG CCCAGACCTG GCCATAAGGG ACACCTTTGT GAATGCCTCT 7 80 

221 VSQI FHS PDL AIRD TFV WAS 240 

781 CGGACCCTGT ACAGCAGCAG CCCCAGAGTC CTAAGCAACA ACAGTGACGC CAACTTGGAG 840 

241 RTLY SSS PRV LSNN SDA NLE 260 



841 CTCATCAACA CCTGGGTGGC CAAGAACACC AACAACAAGA TCAGCCGGCT GCTAGACAGT 900 
261 LINT WVA KNT NNKI SRL LDS 280 



901 CTGCCCTCCG ATACCCGCCT TGTCCTCCTC AATGCTATCT AC CTGAGTGC CAAGTGGAAG 960 
281 LPSD TRL VLL NAIY LSA KWK 300 
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Figure 23B 

961 ACAACATTTG ATCCCAAGAA AACCAGAATG GAACCCTTTC ACTTCAAAAA CTCAGTTATA 1020 

301 TTFD PKK TRM EPFH FKN SVI 320 

1021 AAAGTGCCCA TGATGAATAG CAAGAAGTAC CCTGTGGCCC ATTTCATTGA CCAAACTTTG 1080 

321 KVPM MNS KKY PVAH FID QTL 340 



1081 AAAGCCAAGG TGGGGCAGCT GCAGCTCTCC CACAATCTGA GTTTGGTGAT CCTGGTACCC 1140 

341 KAKV GQL QLS HNLS L V I LVP 360 

1141 CAGAACCTGA AACATCGTCT TGAAGACATG GAACAGGCTC TCAGCCCTTC TGTTTTCAAG 1200 

361 QNLK HRL EDM EQAL SPS VFK 380 



1201 GCCATCATGG AGAAACTGGA GATGTCCAAG TTCCAGCCCA CTCTCCTAAC ACTACCCCGC 1260 

381 AIME KLE MSK FQPT LLT LPR 400 

1261 ATCAAAGTGA CGACCAGNCA GGATATGCTC TCAATCATGG AGAAATTGGA ATTCTTCGAT 1320 

401 IKVT TSQ DML SIME KLE FFD 420 

1321 TTTTCTTATG ACCTTAACCT GTGTGGGCTG ACAGAGGACC CAGATCTTCA GGTTTCTGCG 1380 

421 FSYD LNL CGL TEDP DLQ VSA 440 

1381 ATGCAGCACC AGACAGTGCT GGAACTGACA GAGACTGGGG TGGAGGC GGC TGCAGCCTCC 1440 

441 MQHQ TVL ELT ETGV E A A A A S 460 

1441 GCCATCTCTG TGGCCCGCAC CCTGCTGGTC TTTGAAGTGC AGCAGCCCTT CCTCTTCNTG 1500 

461AISV ART LLV FEVQ QPF LFX 480 



1501 CTCTGGGACC AGCAGCACAA GTTCCCTGTC TTCATGGGGC GAGTATATGA CCCCAGGGCC 1560 
481 LWDQ QHK FPV FMGR VYD PRA 500 

1561 TGAGACCTGC AGGATCAGGT TAGGGCGAGC GCTACCTCTC CAGCCTCAGC TCTCAGTTGC 162 0 



1621 AGCCCTGCTG CTGCCTGCCT GGACTTGCCC CTGCCACCTC CTGCCTCAGG TGTCCGCTAT 1680 



1681 CCACCAAAAG GGCTCCTGAG GGTCTGGGCA AGGGACCTGC TTCTATTAGC CCTTCTCCAT 1740 



1741 GGCCCTGCCA TGCTCTCCAA ACCACTTTTT GCAGCTTTCT CTAGTTCAAG TTCACCAGAC 1800 



1801 TCTATAAATA AAACCTGACA GACCAT 182 6 
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Figure 24 

1 TCCTCCACCT GCTGGCCCCT GGACACCTCT GTCACCATGT GGTTCCTGGT TCTGTGCCTC 60 
1 MWFLVLCL8 



61 GCCCTGTCCC TGGGGGGGAC TGGTGCTGCG CCCCCGATTC AGTCCCGGAT TGTGGGAGGC 12 0 
9 A L S L GGT G A A PPIQ SRI VGG 28 



121 TGGGAGTGTG AGCAGCATTC CCAGCCCTGG CAGGCGGCTC TGTACCATTT CAGCACTTTC 180 
29WECE QHS QPW QAAL YHF STF 48 

181 CAGTGTGGGG GCATCCTGGT GCACCGCCAG TGGGTGCTCA CAGCTGCTCA TTGCATCAGC 240 
49QCGG I L V HRQ WVLT A A H CIS 68 

241 GACAATTACC AGCTCTGGCT GGGTCGCCAC AACTTGTTTG ACGACGAAAA CACAGCCCAG 3 00 
69DNYQ LWL GRH NLFD DEN TAQ 88 

301 TTTGTTCATG TCAGTGAGAG CTTCCCACAC CCTGGCTTCA ACATGAGCCT CCTGGAGAAC 3 60 
89FVHV SES FPH PGFN MSL LEN 108 



361 CACACCCGCC AAGCAGACGA GGACTACAGC CACGACCTCA TGCTGCTCCG CCTGACAGAG 420 

109HTRQ ADE DYS HDLM LLR LTE 128 

421 CCTGCTGATA CCATCACAGA TGCTGTGAAG GTCGTGGAGT TGCCCACCNA GGAACCCGAA 480 

129 PADT ITD A V K VVEL PTX EPE 148 



481 GTGGGGAGCA CCTGTTTGGC TTCCGGCTGG GGCAGCATCG AACCAGAGAA TTTCTCATTT 540 

149 VGST C L A SGW GSIE PEN FSF 168 

541 CCAGATGATC TCCAGTGTGT GGACCTCAAA ATCCTGCCTA ATGATGAGTG CNAAAAAGCC 600 

169 PDDL QCV DLK ILPN DEC XKA 188 



601 CACGTCCAGA AGGTGACAGA CTTCATGCTG TGTGTCGGAC ACCTGGAAGG TGGCAAAGAC 660 
189 HVQK VTD FML CVGH LEG GKD 208 

661 ACCTGTGTGG GTGATTCAGG GGGCCCGCTG ATGTGTGATG GTGTGCTCCA AGGTGTCACA 720 
209 TCVG DSG GPL MCDG VLQ GVT 228 

721 TCATGGGGCT ACGTCCCTTG TGGCACCCCC AATAAGCCTT CTGTCGCCGT CAGAGTGCTG 7 80 
229 SWGY VPC GTP NKPS VAV RVL 248 



781 TCTTATGTGA AGTGGATCGA GGACACCATA GCGGAGAACT CCTGAACGCC CAGCCCTGTC 840 
249 SYVK WIE DTI AENS * 262 



841 CCCTACCCCC AGTAAAATCA AATGTGCATC C 871 



D0053 NP 



1 CTGTGCATGG CATCATCCTG GCCCCCTCTA 
1 MASSWPPL 

61 TTCCCTCAAA ATGCTACGGC CTGTGACAAT 
19FPQN ATA CDN 

121 GTGCTGCCGA CATTTATCAT CTCCATCTGT 
39VLPT FII SIC 



181 CTGTTGGTCT TCCTCCTGCC CCGGCGGCAA 
59 L L V F LLP RRQ 



241 CTGGCAGCCT CTGATCTGGT GTTTGTCTTG 
79LAAS DLV FVL 

301 AACCAGTTTA ACTGGCCTTT CGGAGCCCTC 
99NQFN WPF GAL 

361 GCCAATTTGT TCATCAGCAT CTTCCTGGTG 
119ANLF ISI FLV 



421 CTGGTGCACC CTATGGCCAG CGGAAGGCAG 
139LVHP MAS GRQ 



481 GTGCTCATCT GGGTTGTGGG GGGCCTCTTG 
159 VLIW VVG GLL 



541 CAAGCCGTCC CAGATCTGAA CATCACCGCC 
179 Q A V P DLN ITA 



601 CACTTTGCAA GGATTGTGGA GTTAAATATT 
199 HFAR IVE LNI 



661 GTCTTCTTCA ACTACCACAT CCTGGCCTCC 
219 VFFN YHI LAS 



721 AGAGTGCGGG GGCCGAAGGA TAGCAAGACC 
23 9 R V R G PKD SKT 



781 TTCCTGGTCT GCTGGGCCCC TTACCACTTC 
259 FLVC WAP YHF 



841 CAAGCAGTCC GAGGCTGCTT TTGGGAGGAC 
279 QAVR GCF WED 



re 25A 

GAGCTCCAAT CCTCCAACCA GAGCCAGCTC 60 

ELQS SNQ SQL 18 

GCTCCAGAAG CCTGGGACCT GCTGCACAGA 120 

APEA WDL LHR 38 

TTCTTCGGCC TCCTAGGGAA CCTTTTTGTC 180 

FFGL LGN LFV 58 

CTGAACGTGG CAGAAATCTA CCTGGCCAAC 240 

LNVA EIY LAN 78 



GGCTTGCCCT TCTGGGCAGA GAATATCTGG 3 00 
GLPF WAE NIW 98 



CTCTGCCGTG TCATCAATGG GGTCATCAAG 360 
LCRV ING VIK 118 



GTGGCCATCA GCCAGGACCG CTACCGCGTG 420 

VAIS QDR YRV 138 

CAGCGGCGGA GGCAGGCCCG GGTCACCTGC 480 

QRRR QAR VTC 158 

AGCATCCCCA CATTCCTGCT GCGATCCATC 540 

SIPT FLL RSI 178 



TGCATCCTGC TCCTCCCCCA TGAGGCCTGG 600 

CILL LPH EAW 198 

CTGGGTTTCC TCCTACCACT GGCTGCGATC 660 

LGFL LPL A A I 218 

CTGCGAACGC GGGAGGAGGT CAGCAGGACA 720 

LRTR EEV SRT 238 



AC AGCGC TGA TCCTCACGCT CGTGGTTGCC 7 80 

TALI LTL VVA 258 

TTTGCCTTCC TGGAATTCTT ATTCCAGGTG 840 

FAFL EFL FQV 278 

TTCATTGACC TGGGCCTGCA ATTGGCCAAC 900 

FIDL GLQ LAN 298 
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Figure 25B 



901 TTCTTTGCCT TCACTAACAG CTCCCTGAAT CCAGTAATTT ATGTCTTTGT GGGCCGGCTC 960 
299 FFAF TNS SLN PVIY VFV GRL 318 



961 TTCAGGACCA AGGTCTGGGA ACTTTATAAA CAATGCACCC CTAAAAGTCT TGCTCCAATA 1020 
319 FRTK VWE LYK QCTP KSL API 338 



1021 TCTTCATCCC ATAGGAAAGA AATCTTCCAA CTTTTCTGGC GGAATTAAAA CAGCATTGAA 1080 
339 SSSH RKE I F Q LFWR N* 353 



1081 CC 1082 
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Figure 26A 

1 CTGTGCATGG CATCATCCTG GCCCCCTCTA GAGCTCCAAT CCTCCAACCA GAGCCAGCTC 60 

1 MA SSW PPL ELQS SNQ SQL 18 

61 TTCCCTCAAA ATGCTACGGC CTGTGACAAT GCTCCAGAAG CCTGGGACCT GCTGCACAGA 12 0 

19 F P Q N ATA CDN APEA WDL LHR 38 

121 GTGCTGCCGA CATTTATCAT CTCCATCTGT TTCTTCGGCC TCCTAGGGAA CCTTTTTGTC 180 

3 9V L P T FII SIC FFGL LGN LFV 58 



181 CTGTTGGTCT TCCTCCTGCC CCGGCGGCAA CTGAACGTGG CAGAAATCTA CCTGGCCAAC 240 

59LLVF LLP RRQ LNVA EIY LAN 78 

241 CTGGCAGCCT CTGATCTGGT GTTTGTCTTG GGCTTGCCCT TCTGGGCAGA GAATATCTGG 3 00 

79LAAS DLV FVL GLPF WAE NIW 98 

3 01 AACCAGTTTA ACTGGCCTTT CGGAGCCCTC CTCTGCCGTG TCATCAACGG GGTCATCAAG 3 60 

99NQFN WPF GAL LCRV ING V I K 118 

361 GCCAATTTGT TCATCAGCAT CTTCCTGGTG GTGGCCATCA GCCAGGACCG CTACCGCGTG 420 

119 ANLF ISI FLV VAIS QDR YRV 138 

421 CTGGTGCACC CTATGGCCAG CGGAAGGCAG CAGCGGCGGA GACAGGCCCG GGTCACCTGC 480 

139 LVHP MAS GRQ QRRR QAR VTC 158 

481 GTGCTCATCT GGGTTGTGGG GGGCCTCTTG AGCATCCCCA CATTCCTGCT GCGATCCATC 540 

159 VLIW VVG GLL SIPT FLL RSI 178 



541 CAAGCCGTCC CAGATCTGAA CATCACCGCC TGCATCCTGC TCCTCCCCCA TGAGGCCTGG 600 

179 QAVP DLN I T A CILL LPH EAW 198 

601 CACTTTGCAA GGATTGTGGA GTTAAATATT CTGGGTTTCC TCCTACCACT GGCTGCGATC 660 

199 HFAR IVE L N I LGFL LPL A A I 218 



661 GTCTTCTTCA ACTACCACAT CCTGGCCTCC CTGCGAACGC GGGAGGAGGT CAGCAGGACA 720 
219 VFFN YHI LAS LRTR EEV SRT 238 



721 AGAGTGCGGG GGCCGAAGGA TAGCAAGACC ACAGCGCTGA TCCTCACGCT CGTGGTTGCC 780 
239 RVRG PKD SKT TALI LTL VVA 258 



781 TTCCTGGTCT GCTGGGCCCC TTACCACTTC TTTGCCTTCC TGGAATTCTT ATTCCAGGTG 840 
259 FLVC WAP YHF FAFL EFL FQV 278 



841 CAAGCAGTCC GAGGCTGCTT TTGGGAGGAC TTCATTGACC TGGGCCTGCA ATTGGCCAAC 900 
279 QAVR GCF WED FIDL GLQ LAN 298 
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Figure 26B 



901 TTCTTTGCCT TCACTAACAG CTCCCTGAAT CCAGTAATTT ATGTCTTTGT GGGCCGGCTC 9 60 
299 FFAF TNS SLN PVIY VFV GRL 318 



961 TTCAGGACCA AGGTCTGGGA ACTTTATAAA CAATGCACCC CTAAAAGTCT TGCTCCAATA 1020 
319 FRTK VWE LYK QCTP KSL API 338 



1021 TCTTCATCCC ATAGGAAAGA AATCTTCCAA CTTTTCTGGC GGAATTAAAA CAGCATTGAA 108 0 
339 SSSH RKE IFQ LFWR N * 353 



1081 CC 1082 
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Figure 27A 

1 CTGTGCATGG CATCATCCTG GCCCCCTCTA GAGCTCCAAT CCTCCAACCA GAGCCAGCTC 60 

1 MA SSW PPL ELQS SNQ SQL 18 

61 TTCCCTCAAA ATGCTACGGC CTGTGACAAT GCTCCAGAAG CCTGGGACCT GCTGCACAGA 120 

19FPQN ATA CDN APEA WDL LHR 38 

121 GTGCTGCCGA CATTTATCAT CTCCATCTGT TTCTTCGGCC TCCTAGGGAA CCTTTTTGTC 180 

3 9V L P T FII SIC FFGL LGN LFV 58 



181 CTGTTGGTCT TCCTCCTGCC CCGGCGGCAA CTGAACGTGG CAGAAATCTA CCTGGCCAAC 240 
59LLVF LLP RRQ LNVA EIY LAN 78 

241 CTGGCAGCCT CTGATCTGGT GTTTGTCTTG GGCTTGCCCT TCTGGGCAGA GAATATCTGG 3 00 
79LAAS DLV FVL GLPF WAE NIW 98 

3 01 AACCAGTTTA ACTGGCCTTT CGGAGCCCTC CTCTGCCGTG TCATCAACGG GGTCATCAAG 3 60 
99NQFN WPF GAL LCRV ING V I K 118 

361 GCCAATTTGT TCATCAGCAT CTTCCTGGTG GTGGCCATCA GCCAGGACCG CTACCGCGTG 42 0 
119 ANLF ISI FLV VAIS QDR YRV 138 



421 CTGGTGCACC CTATGGCCAG CGGAAGGCAG CAGCGGCGGA GGCAGGCCCG GGTCACCTGC 480 
139 LVHP MAS GRQ QRRR QAR VTC 158 



481 GTGCTCATCT GGGTTGTGGG GGGCCTCTTG AGCATCCCCA CATTCCTGCT GCGATCCATC 540 

159 VLIW VVG GLL SIPT FLL RSI 178 

541 CAAGCCGTCC CAGATCTGAA CATCACCGCC TGCATCGTGC TCCTCCCCCA TGAGGCCTGG 600 

179 QAVP DLN ITA CIVL LPH EAW 198 

601 CACTTTGCAA GGATTGTGGA GTTAAATATT CTGGGTTTCC TCCTACCACT GGCTGCGATC 660 

199 HFAR IVE LNI LGFL LPL A A I 218 

661 GTCTTCTTCA ACTACCACAT CCTGGCCTCC CTGCGAACGC GGGAGGAGGT CAGCAGGACA 720 

219 VFFN YHI LAS LRTR EEV SRT 238 



721 AGAGTGCGGG GGCCGAAGGA TAGCAAGACC ACAGCGCTGA TCCTCACGCT CGTGGTTGCC 7 80 
239 RVRG PKD SKT TALI LTL VVA 258 



781 TTCCTGGTCT GCTGGGCCCC TTACCACTTC TTTGCCTTCC TGGAATTCTT ATTCCAGGTG 840 
259 FLVC WAP YHF FAFL EFL FQV 278 



841 CAAGCAGTCC GAGGCTGCTT TTGGGAGGAC TTCATTGACC TGGGCCTGCA ATTGGCCAAC 900 
279 QAVR GCF WED FIDL GLQ LAN 298 
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Figure 27B 



901 TTCTTTGCCT TCACTAACAG CTCCCTGAAT CCAGTAATTT ATGTCTTTGT GGGCCGGCTC 960 
299 FFAF TNS SLN PVIY VFV GRL 318 



961 TTCAGGACCA AGGTCTGGGA ACTTTATAAA CAATGCACCC CTAAAAGTCT TGCTCCAATA 102 0 
319 FRTK VWE LYK QCTP KSL API 338 



1021 TCTTCATCCC ATAGGAAAGA AATCTTCCAA CTTTTCTGGC GGAATTAAAA CAGCATTGAA 1080 
339 SSSH RKE IFQ LFWR N * 353 



1081 CC 1082 
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1 CTGTGCATGG CATCATCCTG GCCCCCTCTA 
1 MASSWPPL 

61 TTCCCTCAAA ATGCTACGGC CTGTGACAAT 
19FPQN ATA CDN 



121 GTGCTGCCGA CATTTATCAT CTCCATC TGT 
39 V L P T FIT SIC 



181 CTGTTGGTCT TCCTCCTGCC CCGGCGGCAA 
59LLVF LLP RRQ 



241 CTGGCAGCCT CTGATCTGGT GTTTGTCTTG 
79LAAS DLV FVL 



301 AACCAGTTTA ACTGGCCTTT CGGAGCCCTC 
99NQFN WPF GAL 



361 GCCAATTTGT TCATCAGCAT CTTCCTGGTG 
119 ANLF ISI FLV 



421 CTGGTGCACC CTATGGCCAG CGGAAGGCAG 
139 LVHP MAS GRQ 

481 GTGCTCATCT GGGTTGTGGG GGGCCTCTTG 
159 VLIW VVG GLL 

541 CAAGCCGTCC CAGATCTGAA CATCACCGCC 
179 QAVP DLN I T A 



601 CACTTTGCAA GGATTGTGGA GTTAAATATT 
199 HFAR IVE LNI 



661 GTCTTCTTCA ACTACCACAT CCTGGCCTCC 
219 VFFN YHI LAS 



721 AGAGTGCGGG GGCCGAAGGA TAGCAAGACC 
239 RVRG PKD SKT 



781 TTCCTGGTCT GCTGGGCCCC TTACCACTTC 
259 FLVC WAP YHF 

841 CAAGCAGTCC GAGGCTGCTT TTGGGAGGAC 
279 QAVR GCF WED 



re 28A 

GAGCTCCAAT CCTCCAACCA GAGCCAGCTC 6 0 
ELQS SNQ SQL 18 

GCTCCAGAAG CCTGGGACCT GCTGCACAGA 12 0 
APEA WDL LHR 38 



TTCTTCGGCC TCCTAGGGAA CCTTTTTGTC 180 

FFGL LGN LFV 58 

CTGAACGTGG CAGAAATCTA CCTGGCCAAC 240 

LNVA EIY LAN 78 



GGCTTGCCCT TCTGGGCAGA GAATATCTGG 300 
GLPF WAE NIW 98 



CTCTGCCGTG TCATCAACGG GGTCATCAAG 3 60 
LCRV ING VIK 118 



GTGGCCATCA GCCAGGACCG CT AC CGCGTG 420 
VAIS QDR YRV 138 



CAGCGGCGGA GGCAGGCCCG GGTCACCTGC 480 
QRRR QAR VTC 158 

AGCATCCCCA CATTCCTGCT GCGATC CATC 540 
SIPT FLL RSI 178 

TGCATCCTGC TCCTCCCCCA TGAGGCCTGG 600 
CILL LPH EAW 198 



CTGGGTTTCC TCCTACCACT GGCTGCGATC 660 
LGFL LPL A A I 218 



CTGCGAACGC GGGAGGAGGT CAGCAGGACA 720 
LRTR KEV SRT 238 



ACAGCGCTGA TCCTCACGCT CGTGGTTGCC 780 
TALI LTL VVA 258 

TTTGCCTTCC TGGAATTCTT ATTCCAGGTG 840 
FAFL EFL FQV 278 

TTCATTGACC TGGGCCTGCA ATTGGCCAAC 900 
FIDL GLQ LAN 298 



D0053 NP 



Figure 2 8B 



901 TTCTTTGCCT TCACTAACAG CTCCCTGAAT CCAGTAATTT ATGTCTTTGT GGGCCGGCTC 960 
299 FFAF TNS SLN PVIY VFV GRL 318 



961 TTCAGGACCA AGGTCTGGGA ACTTTATAAA CAATGCACCC CTAAAAGTCT TGCTCCAATA 102 0 
319 FRTK VWE LYK QCTP KSL API 338 

1021 TCTTCATCCC ATAGGAAAGA AATCTTCCAA CTTTTCTGGC GGAATTAAAA CAGCATTGAA 1080 
339 SSSHRKEIFQLFWRN* 353 



1081 CC 1082 
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Figure 29A 

1 ATGTTCTCTC CCTGGAAGAT ATCAATGTTT CTGTCTGTTT GTGAGGACTC CGTGCCCACC 60 

1MFSP WKI SMF LSVC EDS VPT 20 

61 ACGGCCTCTT TCAGCGCCGA CATGCTCAAT GTCACCTTGC AAGGGCCCAC TCTTAACGGG 120 

21TASF SAD MLN VTLQ GPT LNG 40 

121 ACCTTTGCCC AGAGCAAATG CCCCCAAGTG GAGTGGCTGG GCTGGCTCAA CACCATCCAG 180 

41TFAQ SKC PQV EWLG WLN TIQ 60 

181 CCCCCCTTCC TCTGGGTGCT GTTCGTGCTG GCCACCCTAG AGAACATCTT TGTCCTCAGC 240 

61PPFL WVL FVL ATLE NIF VLS 80 



241 GTCTTCTGCC TGCACAAGAG CAGCTGCACG GTGGCAGAGA TCTACCTGGG GAACCTGGCC 300 
81VFCL HKS SCT VAEI YLG NLA 100 



301 GCAGCAGACC TGATCCTGGC CTGCGGGCTG CCCTTCTGGG CCATCACCAT CTCCAACAAC 3 60 

101 AADL ILA CGL PFWA ITI SNN 120 

361 TTCGACTGGC TCTTTGGGGA GACGCTCTGC CGCGTGGTGA ATGCCATTAT CTCCATGAAC 42 0 

121 FDWL FGE TLC RVVN All SMN 140 

421 CTGTACAGCA GCATCTGTTT CCTGATGCTG GTGAGC AT CG ACCGCTACCT GGCCCTGGTG 480 

141 LYSS ICF Li M L VSID RYL ALV 160 

481 AAAACCATGT CCATGGGCCG GATGCGCGGC GTGCGCTGGG CCAAGCTCTA CAGCTTGGTG 540 

161 KTMS MGR MRG VRWA KLY SLV 180 

541 ATCTGGGGGT GTACGCTGCT CCTGAGCTCA CCCATGCTGG TGTTCCGGAC CATGAAGGAG 600 

181 IWGC TLL LSS PMLV FRT MKE 200 

601 TACAGCGATG AGGGCCACAA CGTCACCGCT TGTGTCATCA GCTACCCATC CCTCATCTGG 660 

201 YSDE GHN VTA CVIS YPS LIW 220 



661 GAAGTGTTCA CCAACATGCT CCTGAATGTC GTGGGCTTCC TGCTGCCCCT GAGTGTCATC 720 
221 EVFT NML LNV VGFL LPL SVI 240 



721 ACCTTCTGCA CGATGCAGAT CATGCAGGTG C TGCGGAAC A AC GAGATGC A GAAGTTCAAG 780 
241 TFCT M Q I MQV LRNN EMQ KFK 260 



7 81 GAGATCCAGA CGGAGAGGAG GGCCACGGTG CTAGTCCTGG TTGTGCTGCT GCTATTCATC 840 

261 EIQT ERR A T V LVLV VLL LFI 280 

841 ATCTGCTGGC TGCCCTTCCA GATCAGCACC TTCCTGGATA CGCTGCATCG CCTCGGCATC 900 

281 ICWL PFQ 1ST FLDT LHR LGI 300 
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901 CTCTCCAGCT GCCAGGACGA GCGCATCATC GATGTAATCA CACAGATCGC CTCCTTCATG 960 

301LSSC QDE RII DVIT Q I A SFM 320 

961 GCCTACAGCA ACAGCTGCCT CAACCCACTG GTGTACGTGA TCGTGGGCAA GCGCTTCCGA 102 0 

321AYSN SCL NPL VYVI VGK RFR 340 

1021 AAGAAGTCTT GGGAGGTGTA CCAGGGAGTG TGCCAGAAAG GGGGCTGCAG GTCAGAACCC 108 0 

341 KKSW EVY QGV CQKG GCR SEP 360 

1081 ATTCAGATGG AGAACTCCAT GGGC AC AC TG CGGACCTCCA TCTCCGTGGA ACGCCAGATT 1140 

361 IQME NSM GTL RTSI SVE RQI 380 

1141 CACAAACTGC AGGAC TGGGC AGGGAGCAGA CAGTGAGCAA ACGCCAGCAG GGCTGCTGTG 1200 

381 HKLQ DWA GSR Q * 391 

12 01 AATTTGTGTA AGGATTGAGG GACAGTTGCT TTTCAGCATG GGCCCAGGAA TGCCAAGGAG 1260 

12 61 ACATCTATGC ACGACCTTGG GAAATGAGTT GATGTCTCCG GTAAAACACC GGAGACTAAT 132 0 

1321 TCCTGCCCTG CCCAATTTTG CAGGGAGCAT GGCTGTGAGG ATGGGGTGAA CTCACGCACA 1380 

1381 GCCAAGGACT CCAAAATCAC AACAGCATTA CTGTTCTTAT TTGCTGCCAC AC CTGAGC C A 1440 

1441 GCCTGCTCCT TCCCAGGAGT GGAGGAGGCC TGGGGGGAGG GAGAGGAGTG ACTGAGCTTC 1500 

1501 CCTCCCGTGT GTTCTCCGTC CCTGCCCCAG CAAGACAACT TAGATCTCCA GGAGAACTGC 1560 

1561 CATCCAGCTT TGGTGCAATG GCTGAGTGCA CAAGTGAGTT GTTGCCCTGG GTTTCTTTAA 162 0 

1621 TCTATTCAGC TAGAACTTTG AAGGACAATT TCTTGCATTA ATAAAGGTTA AGCCCTGAGG 1680 

1681 GGTCCCTGAT AACAACCTGG AGACCAGGAT TTTATGGCTC CCCTCACTGA TGGACAAGGA 1740 

1741 GGTCTGTGCC AAAGAAGAAT CCAATAAGCA CATATTGAGC ACTTGCTGTA TATGCAGTAT 180 0 

1801 TGAGC AC TGT AGGCAAGACC CAAGAAAGAG AAGGAGCCAT CTCCATCTTG AAGGAACTCA 186 0 

1861 AAGACTCAAG TGGGAACGAC TGGGCACTGC CACCACCAGA AAGCTGTTCG ACGAGACGGT 192 0 

1921 CGAGCAGGGT GCTGTGGGTG ATATGGACAG CAGAAGGGGG AGACCAAGGT TCCAGCTCAA 1980 

1981 CCAATAACTA TTGCACAACC ACCTGTCCCT GCCTCAGTTC CCTTTTATGT AACATGAAGT 2 040 
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2 041 CGTTGTGAGG GTTAAAGGCA GTAACAGGTA 
2101 GTACATGTGA GGCATCATTA CGCAGACGTA 
2161 ACTGAGGTCT AGAAATAGCT CCGTGGAGCA 
2 221 AAGCACCAGT GTCTGGCACA CAGTAGGTGC 
2 281 CCACCCTGAG GCCCCAACCG CCACACACAC 
2 341 AAAGTCTGAT TTGTGATGAG GCAGAGGAAG 
2401 ACAGTGCTGA GACCCCCCAC CACCAGCCGG 
2 461 TCAGGGACTG TTCCTGTCTC AGCAACCAAG 
2 521 GAAGGTGGCC CAGTATGAGC CCTAGAAGAG 
2 581 TCGGCAGTGC CAGGGCAGCA CTCATTCACT 
2641 GAGCTAGAAC CTGGAGAGCT AGAACCTGGA 
2701 GAGGCTAGAA CCAAGAAGGG CTAGAACCTG 
27 61 TGAGCTAGAA GCTGGAGGAC TAGAACCTGG 
2 821 AGGGC TGGAA TCTGGAGAGC TAGAACCTGG 
2 881 AAGGGCTAGA AC CTGGAGGG CTGGAATCTG 

2 941 GAGGGCTAGA ACCTAGAAGG GCTAGAACCT 

3 001 AGAAGGGCTA GAACCTGGAG AGCCAGAACC 
3 061 CTGTAGAGCT AGAACATGGA GAGCTAGAAC 
3121 CTGGAGGGAA TGAACCTGGA GGGCTAGAAC 
3181 GAGCCCATAA ATCCTGACCA ATCCAACTCT 



re 29C 

TAAAGT AC TT AGAAAAGCAA AGGGTGCTAC 2100 
AC TGGGAT AT GTTTACTATA AGGAAAAGAC 2160 
GAATCAGTAT TGGGAGC CGG TGGCGGTGTG 222 0 
TCATTGGCTC CCTTCCACCT GTCATTCCCA 2 28 0 
AGGAGCATTT GGAGAGAAGG CCATGTCTTC 2 340 
ATATTTCTAA TCGGTCTTGC CCAGAGGATC 2400 
TACCTGGGAA GGGGGAGAGT GCAGGCCTGC 2460 
GGATTGTTCC TGTCAATCAA TGGTTTATTG 2520 
TGTGAAAAGG AATGGCAATG GTGTTCACCA 2580 
TGATAAATGA ATATTTATTA GCTGGTTGGA 2640 
GAACTAGAAC CTGGAGGGCT AGAACCTGGA 2700 
GAGGGGCTAG AACCTAGAGA AGCTAAAACC 27 60 
AGGGCTGGAA TCTGAAGGGC TAGAACCTGG 2 82 0 
AGGGCTAGAA CCTGGAGGGC TAGAACCTAG 2880 
GAGAGCTAGA ACCTGGAGGG CTAGAACCTG 2 940 
GGAGGGC TAG AACCTGGCAG GTTAGAACCT 3 000 
TGGAGGGCTA GAACCTGGAA GGGCTAGAAC 3 060 
CCGGCAGGCT AGAACCTGGC AAGCTAGAAC 312 0 
CTGGAGAATG AGAAAAATTT ACATGGCAAA 3180 
GAATTTTAAA GCAAAAGCGT GAAAAAAAAG 3 240 



D0053 NP 

Figure 29D 

3241 ATTCCCTCCT TACCCCCAAC CCACTCTTTT TTCCCACCAC CCACTCTCCT CTGCCTCAGT 33 00 
33 01 AAGTATCTGG AGGAAGAAAA CAGGTGAAAG AAGAAGTAAA AACCATTTAG TATTAGTATT 3360 
33 61 AGAATGAAGT CAAACTGTGC CACACATGGT GAATGAAAAA AAAAAAAAAG AGGCTGTGTT 3 42 0 
3421 TTGTCACACA GGGCAGTCAT TCAGCACCAG AGCACGTGAT GGTCTGAGAC TCTCTTAGGA 3480 
3481 GCAGAGCTCT GCCGCAATGG CCATGTGGGG ATCCACACCT GGTCTGAGGG GCAACTGAGT 3540 
3541 CTGCGGGAGA AGAGCGGCCC TATGCATGGT GTAGATGCCC TGATAAAGAA CATCTGTCCT 3600 
3 601 GTGAAAGACT CAATGAGCTG TTATGTTGTA AACAGGAAGC ATTTCACATC CAAACGAGAA 366 0 
3 661 AATCATGTAA ACATGTGTCT TTTCTGTAGA GCATAATAAA TGGATGAGGT TTTTGCAAAA 372 0 
3721 AAAAAAAAAA AAA 3733 
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1 ATGTTCTCTC CCTGGAAGAT ATCAATGTTT CTGTCTGTTC GTGAGGACTC CGTGCCCACC 60 
1MFSP WKI SMF LSVR EDS VPT 20 

61 ACGGCCTCTT TCAGCGCCGA CATGCTCAAT GTCACCTTGC AAGGGCCCAC TCTTAACGGG 12 0 
21TASF SAD MLN VTLQ GPT LNG 40 

121 ACCTTTGCCC AGAGCAAATG CCCCCAAGTG GAGTGGCTGG GCTGGCTCAA CACCATCCAG 180 
41TFAQ SKC PQV EWLG WLN T I Q 60 

181 CCCCCCTTCC TCTGGGTGCT GTTCGTGCTG GCCACCCTAG AGAACATCTT TGTCCTCAGC 240 
61PPFL WVL FVL ATLE NIF V L, S 80 

241 GTCTTCTGCC TGCACAAGAG CAGCTGCACG GTGGCAGAGA TCTACCTGGG GAACCTGGCC 3 00 
81VFCL HKS SCT VAEI Y L G NLA 100 

3 01 GCAGCAGACC TGATCCTGGC CTGCGGGCTG CCCTTCTGGG CCATCACCAT CTCCAACAAC 360 
101 AADL I L A CGL PFWA ITI SNN 120 

3 61 TTCGACTGGC TCTTTGGGGA GACGCTCTGC CGCGTGGTGA ATGCCATTAT CTCCATGAAC 42 0 
121 FDWL FGE TLC RVVN All SMN 140 

421 CTGTACAGCA GCATCTGTTT CCTGATGCTG GTGAGCATCG ACCGCTACCT GGCCCTGGTG 480 
141LYSS ICF LML VSID RYL A L V 160 

481 AAAACCATGT CCATGGGCCG GATGCGCGGC GTGCGCTGGG CCAAGCTCTA CAGCTTGGTG 540 
161 KTMS MGR MRG VRWA KLY SLV 180 

541 ATCTGGGGGT GTACGCTGCT CCTGAGCTCA CCCATGCTGG TGTTCCGGAC CATGAAGGAG 600 
181 IWGC TLL LSS PMLV FRT MKE 200 

601 TACAGCGATG AGGGCCACAA CGTCACCGCT TGTGTCATCA GCTACCCATC CCTCATCTGG 660 
201 YSDE GHN VTA CVIS YPS LIW 220 

661 GAAGTGTTCA CCAACATGCT CCTGAATGTC GTGGGCTTCC TGCTGCCCCT GAGTGTCATC 720 
221 EVFT NML LNV VGFL LPL SVI 240 

721 ACCTTCTGCA CGATGCAGAT CATGCAGGTG CTGCGGAACA ACGAGATGCA GAAGTTCAAG 7 80 
241 TFCT MQI MQV LRNN EMQ KFK 260 

781 GAGATCCAGA CGGAGAGGAG GGCCACGGTG CTAGTCCTGG TTGTGCTGCT GCTATTCATC 840 
261 EIQT ERR A T V LVLV VLL LFI 280 



841 ATCTGCTGGC TGCCCTTCCA GATCAGCACC TTCCTGGATA CGCTGCATCG CCTCGGCATC 900 
281 ICWL PFQ 1ST FLDT LHR L G I 300 
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901 CTCTCCAGCT GCCAGGACGA GCGCATCATC GACGTAATCA CACAGATCGC CTCCTTCATG 960 
301 LSSC QDE RII DVIT Q I A SFM 320 

961 GCCTACAGCA ACAGCTGCCT CAACCCACTG GTGTACGTGA TCGTGGGCAA GCGCTTCCGA 1020 
321 AYSN SCL NPL VYVI VGK RFR 340 

1021 AAGAAGTCTT GGGAGGTGTA CCAGGGAGTG TGCCAGAAAG GGGGCTGCAG GTCAGAACCC 108 0 
341 KKSW EVY QGV CQKG GCR SEP 360 

1081 ATTCAGATGG AGAACTCCAT GGGCACACTG CGGACCTCCA TCTCCGTGGA ACGCCAGATT 1140 
361 IQME NSM GTL RTSI SVE RQI 380 

1141 CACAAACTGC AGGACTGGGC AGGGAGCAGA CAGTGAGCAA ACGCCAGCAG GGCTGCTGTG 1200 
381 HKLQ DWA GSR Q* 391 

1201 AATTTGTGTA AGGATTGAGG GACAGTTGCT TTTCAGCATG GGCCCAGGAA TGCCAAGGAG 1260 

1261 ACATCTATGC ACGACCTTGG GAAATGAGTT GATGTCTCCG GTAAAACACC GGAGACTAAT 132 0 

1321 TCCTGCCCTG CCCAATTTTG CAGGGAGCAT GGCTGTGAGG ATGGGGTGAA CTCACGCACA 13 80 

13 81 GCCAAGGACT CCAAAATCAC AACAGCATTA CTGTTCTTAT TTGCTGCCAC ACCTGAGCCA 144 0 

1441 GCCTGCTCCT TCCCAGGAGT GGAGGAGGCC TGGGGGGAGG GAGAGGAGTG ACTGAGCTTC 1500 

1501 CCTCCCGTGT GTTCTCCGTC CCTGCCCCAG CAAGACAACT TAGATCTCCA GGAGAACTGC 1560 

1561 CATCCAGCTT TGGTGCAATG GCTGAGTGCA CAAGTGAGTT GTTGCCCTGG GTTTCTTTAA 1620 

1621 TCTATTCAGC TAGAACTTTG AAGGACAATT TCTTGCATTA ATAAAGGTTA AGCCCTGAGG i680 

1681 GGTCCCTGAT AACAACCTGG AGACCAGGAT TTTATGGCTC CCCTCACTGA TGGACAAGGA 1740 

1741 GGTCTGTGCC AAAGAAGAAT CCAATAAGCA CATATTGAGC ACTTGCTGTA TATGCAGTAT 1800 

1801 TGAGC AC TGT AGGCAAGACC CAAGAAAGAG AAGGAGC CAT CTCCATCTTG AAGGAACTCA 186 0 

1861 AAGACTCAAG TGGGAACGAC TGGGCACTGC CACCACCAGA AAGCTGTTCG ACGAGACGGT 192 0 

1921 CGAGCAGGGT GCTGTGGGTG ATATGGACAG CAGAAGGGGG AGACCAAGGT TCCAGCTCAA 198 0 

1981 CCAATAACTA TTGCACAACC ACCTGTCCCT GCCTCAGTTC CCTTTTATGT AACATGAAGT 2040 



Figure 30C 

2 041 CGTTGTGAGG GTTAAAGGCA GTAAC AGGT A TAAAGTACTT AGAAAAGCAA AGGGTGCTAC 2100 
2101 GTACATGTGA GGCATCATTA CGCAGACGTA ACTGGGATAT GTTTACTATA AGGAAAAGAC 2160 
2161 ACTGAGGTCT AGAAATAGCT CCGTGGAGCA GAATCAGTAT TGGGAGCCGG TGGCGGTGTG 2 22 0 
2221 AAGCACCAGT GTCTGGCACA CAGTAGGTGC TCATTGGCTC CCTTCCACCT GTCATTCCCA 2 280 
2281 CCACCCTGAG GCCCCAACCG CCACACACAC AGGAGCATTT GGAGAGAAGG CCATGTCTTC 2340 
2341 AAAGTCTGAT TTGTGATGAG GCAGAGGAAG ATATTTCTAA TCGGTCTTGC CCAGAGGATC 240 0 
2401 AC AGTGC TGA GACCCCCCAC CACCAGCCGG T AC CTGGGAA GGGGGAGAGT GCAGGCCTGC 246 0 
2461 TCAGGGACTG TTCCTGTCTC AGCAACCAAG GGATTGTTCC TGTCAATCAA TGGTTTATTG 2 52 0 
2 521 GAAGGTGGCC CAGTATGAGC CCTAGAAGAG TGTGAAAAGG AATGGCAATG GTGTTCACCA 258 0 
2 581 TCGGC AGTGC CAGGGCAGCA CTCATTCACT TGATAAATGA ATATTTATTA GCTGGTTGGA 2 640 
2 641 GAGCTAGAAC CTGGAGAGCT AGAACCTGGA GAACTAGAAC CTGGAGGGCT AGAACCTGGA 270 0 
2 701 GAGGCTAGAA CCAAGAAGGG CTAGAACCTG GAGGGGCTAG AACCTAGAGA AGCTAAAACC 2 76 0 
2761 TGAGCTAGAA GCTGGAGGAC TAGAACCTGG AGGGC TGGAA TCTGAAGGGC TAGAAC CTGG 2 82 0 
2 821 AGGGCTGGAA TCTGGAGAGC TAGAACCTGG AGGGC T AGAA CCTGGAGGGC TAGAACCTAG 2 88 0 

2 881 AAGGGCTAGA ACCTGGAGGG CTGGAATCTG GAGAGCTAGA ACCTGGAGGG CTAGAACCTG 2 940 
2941 GAGGGCTAGA ACCTAGAAGG GCTAGAACCT GGAGGGCTAG AACCTGGCAG GTTAGAACCT 3000 

3 001 AGAAGGGCTA GAACCTGGAG AGCCAGAACC TGGAGGGCTA GAACCTGGAA GGGCTAGAAC 3 060 
3 061 CTGTAGAGCT AGAACATGGA GAGCTAGAAC CCGGCAGGCT AGAACCTGGC AAGCTAGAAC 312 0 
3121 CTGGAGGGAA TGAACCTGGA GGGCTAGAAC CTGGAGAATG AGAAAAATTT ACATGGCAAA 3180 
3181 GAGCCCATAA ATCCTGACCA ATCCAACTCT GAATTTTAAA GCAAAAGCGT GAAAAAAAAG 3240 
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Figure 30D 

3241 ATTCCCTCCT TACCCCCAAC CCACTCTTTT TTCCCACCAC CCACTCTCCT CTGCCTCAGT 3 30 0 
3 301 AAGTATCTGG AGGAAGAAAA CAGGTGAAAG AAGAAGTAAA AACCATTTAG TATTAGTATT 33 60 
33 61 AGAATGAAGT CAAACTGTGC CACACATGGT GAATGAAAAA AAAAAAAAAG AGGCTGTGTT 3420 
3421 TTGTCACACA GGGCAGTCAT TCAGCACCAG AGCACGTGAT GGTCTGAGAC TCTCTTAGGA 3480 
3481 GCAGAGCTCT GCCGCAATGG CCATGTGGGG ATCCACACCT GGTCTGAGGG GCAACTGAGT 3 540 
3 541 CTGCGGGAGA AGAGCGGCCC TATGCATGGT GTAGATGCCC TGATAAAGAA CATCTGTCCT 3 600 
3 601 GTGAAAGACT CAATGAGCTG TTATGTTGTA AACAGGAAGC ATTTCACATC CAAACGAGAA 3660 
3 661 AATCATGTAA ACATGTGTCT TTTCTGTAGA GCATAATAAA TGGATGAGGT TTTTGCAAAA 372 0 
3721 AAAAAAAAAA AAA 3733 
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1 ATGTTCTCTC CCTGGAAGAT ATCAATGTTT 
1MFSP W K I SMF 



61 ACGGCCTCTT TCAGCGCCGA CATGCTCAAT 
21TASF SAD MLN 

121 ACCTTTGCCC AGAGCAAATG CCCCCAAGTG 
41TFAQ SKC PQV 

181 CCCCCCTTCC TCTGGGTGCT GTTCGTGCTG 
61PPFL WVL FVL 



241 GTCTTCTGCC TGCACAAGAG CAGCTGCACG 
81VFCL HKS SCT 



301 GCAGCAGACC TGATCCTGGC CTGCGGGCTG 
101 AADL I L A C G L 



361 TTCGACTGGC TCTTTGGGGA GACGCTCTGC 
121 FDWL FGE T L C 



421 CTGTACAGCA GCATCTGTTT CCTGATGCTG 
141 LYSS ICF L M L 



481 AAAACCATGT CCATGGGCCG GATGCGCGGC 
161 KTMS MGR MRG 

541 ATCTGGGGGT GTACGCTGCT CCTGAGCTCA 
181 IWGC T L L LSS 



601 TACAGCGATG AGGGCCACAA CGTCACCGCT 
201 YSDE GHN VTA 



661 GAAGTGTTCA CCAACATGCT CCTGAATGTC 
22 1 E V F T NML LNV 



721 ACCTTCTGCA CGATGCAGAT CATGCAGGTG 
241 TFCT M Q I MQV 

781 GAGATCCAGA CGGAGAGGAG GGCCACGGTG 
261 EIQT ERR A T V 

841 ATCTGCTGGC TGCCCTTCCA GATCAGCACC 
281 ICWL PFQ 1ST 



:e 31A 

CTGTCTGTTC GTGAGGACTC CGTGCCCACC 60 
LSVR EDS VPT 20 



GTCACCTTGC AAGGGCCCAC TCTTAACGGG 120 
V T L Q GPT LNG 40 

GAGTGGC TGG GCTGGCTCAA CACCATCCAG 180 
EWLG WLN T I Q 60 

GCCACCCTAG AGAACATCTT TGTCCTCAGC 240 
A T L E NIF VLS 80 



GTGGCAGAGA TCTACCTGGG GAACCTGGCC 3 00 
VAEI YLG NLA 100 



CCCTTCTGGG CCATCACCAT CTCCAACAAC 3 60 
PFWA ITI SNN 120 

CGCGTGGTGA ATGCCATTAT CTCCATGAAC 420 
RVVN All SMN 140 

GTGAGCATCG ACCGCTACCT GGCCCTGGTG 480 
VSID RYL A Li V 160 

GTGCGCTGGG CCAAGCTCTA CAGCTTGGTG 540 
VRWA KLY SLV 180 

CCCATGCTGG TGTTCCGGAC CATGAAGGAG 6 00 
PMLV FRT MKE 200 

TGTGTCATCA GCTACCCATC CCTCATCTGG 6 60 
CVIS YPS L I W 220 



GTGGGCTTCC TGCTGCCCCT GAGTGTCATC 720 
VGFL L P L SVI 240 



CTGCGGAACA ACGAGATGCA GAAGTTCAAG 7 80 
LRNN EMQ KFK 260 



CTAGTCCTGG TTGTGCTGCT GCTATTCATC 840 
LVLV VLL LFI 280 



TTCCTGGATA CGCTGCATCG CCTCGGCATC 9 00 
FLDT LHR LGI 300 
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901 CTCTCCAGCT GCCAGGACGA GCGCATCATC GATGTAATCA CACAGATCGC CTCCTTCATG 960 
301 LSSC QDE RII DVIT Q I A SFM 320 

961 GCCTACAGCA ACAGCTGCCT CAACCCACTG GTGTACGTGA TCGTGGGCAA GCGCTTCCGA 102 0 
321 AYSN SCL NPL VYVI VGK RFR 340 

1021 AAGAAGTCTT GGGAGGTGTA CCAGGGAGTG TGCCAGAAAG AGGGCTGCAG GTCAGAACCC 1080 
341 KKSW EVY QGV CQKE GCR SEP 360 

1081 ATTCAGATGG AGAACTCCAT GGGCACACTG CGGACCTCCA TCTCCGTGGA ACGCCAGATT 1140 
361 IQME NSM GTL RTSI SVE R Q I 380 

1141 CACAAACTGC AGGACTGGGC AGGGAGCAGA CAGTGAGCAA ACGCCAGCAG GGCTGCTGTG 1200 
381 HKLQ DWA GSR Q * 391 

1201 AATTTGTGTA AGGATTGAGG GACAGTTGCT TTTCAGCATG GGCCCAGGAA TGCCAAGGAG 1260 

1261 ACATCTATGC ACGACCTTGG GAAATGAGTT GATGTCTCCG GTAAAACACC GGAGACTAAT 132 0 

1321 TCCTGCCCTG CCCAATTTTG CAGGGAGCAT GGCTGTGAGG ATGGGGTGAA CTCACGCACA 1380 

13 81 GCCAAGGACT C C AAAATC AC AACAGCATTA CTGTTCTTAT TTGCTGCCAC ACCTGAGCCA 1440 

1441 GCCTGCTCCT TCCCAGGAGT GGAGGAGGCC TGGGGGGAGG GAGAGGAGTG ACTGAGCTTC 150 0 

1501 CCTCCCGTGT GTTCTCCGTC CCTGCCCCAG CAAGACAACT TAGATCTCCA GGAGAACTGC 1560 

1561 CATCCAGCTT TGGTGCAATG GCTGAGTGCA CAAGTGAGTT GTTGCCCTGG GTTTCTTTAA 162 0 

1621 TCTATTCAGC TAGAACTTTG AAGGACAATT TCTTGCATTA ATAAAGGTTA AGCCCTGAGG 168 0 

1681 GGTCCCTGAT AACAACCTGG AGACCAGGAT TTTATGGCTC CCCTCACTGA TGGACAAGGA 1740 

1741 GGTCTGTGCC AAAGAAGAAT CCAATAAGCA CATATTGAGC ACTTGCTGTA TATGCAGTAT 1800 

1801 TGAGCACTGT AGGCAAGACC CAAGAAAGAG AAGG AGC CAT CTCCATCTTG AAGGAAC TC A 1860 

1861 AAGACTCAAG TGGGAACGAC TGGGCACTGC CACCACCAGA AAGCTGTTCG ACGAGACGGT 192 0 

1921 CGAGCAGGGT GCTGTGGGTG ATATGGACAG CAGAAGGGGG AGACCAAGGT TCCAGCTCAA 1980 

1981 CCAATAACTA TTGCACAACC ACCTGTCCCT GCCTCAGTTC CCTTTTATGT AACATGAAGT 2 040 



Figure 31C 

2 041 CGTTGTGAGG GTTAAAGGCA GTAACAGGTA TAAAGTACTT AGAAAAGCAA AGGGTGCTAC 2100 
2101 GTACATGTGA GGCATCATTA CGCAGACGTA AC TGGGAT AT GTTTACTATA AGGAAAAGAC 2160 
2161 ACTGAGGTCT AGAAATAGCT CCGTGGAGCA GAATCAGTAT TGGGAGCCGG TGGCGGTGTG 2220 
2221 AAGCACCAGT GTCTGGCACA CAGTAGGTGC TCATTGGCTC CCTTCCACCT GTCATTCCCA 2 280 
2281 CCACCCTGAG GCCCCAACCG CCACACACAC AGGAGCATTT GGAGAGAAGG CCATGTCTTC 2 340 
2341 AAAGTCTGAT TTGTGATGAG GCAGAGGAAG ATATTTCTAA TCGGTCTTGC CCAGAGGATC 2400 
2401 AC AGTGC TGA GACCCCCCAC CACCAGCCGG T AC CTGGGAA GGGGGAGAGT GCAGGCCTGC 2460 
2 461 TCAGGGACTG TTCCTGTCTC AGCAACCAAG GGATTGTTCC TGTCAATCAA TGGTTTATTG 2 52 0 
2 521 GAAGGTGGCC CAGTATGAGC CCTAGAAGAG TGTGAAAAGG AATGGCAATG GTGTTCACCA 258 0 
2 581 TCGGCAGTGC CAGGGCAGCA CTCATTCACT TGATAAATGA ATATTTATTA GCTGGTTGGA 2640 
2 641 GAGCTAGAAC CTGGAGAGCT AGAACCTGGA GAACTAGAAC CTGGAGGGCT AGAACCTGGA 270 0 
2701 GAGGCTAGAA CCAAGAAGGG CTAGAACCTG GAGGGGCTAG AACCTAGAGA AGCTAAAACC 27 6 0 
2761 TGAGC T AGAA GCTGGAGGAC TAGAACCTGG AGGGCTGGAA TCTGAAGGGC TAGAACCTGG 2 82 0 
2 821 AGGGCTGGAA TCTGGAGAGC TAGAACCTGG AGGGCTAGAA CCTGGAGGGC TAGAACCTAG 2 88 0 
2 881 AAGGGCTAGA ACCTGGAGGG CTGGAATCTG GAGAGCTAGA ACCTGGAGGG CTAGAACCTG 2 94 0 

2 941 GAGGGCTAGA ACCTAGAAGG GCTAGAACCT GGAGGGC TAG AACCTGGCAG GTTAGAACCT 3 00 0 

3 001 AGAAGGGCTA GAACCTGGAG AGCCAGAACC TGGAGGGCTA GAACCTGGAA GGGCTAGAAC 3 06 0 
3 061 CTGTAGAGCT AGAACATGGA GAGCTAGAAC CCGGCAGGCT AGAACCTGGC AAGCTAGAAC 312 0 
3121 CTGGAGGGAA TGAACCTGGA GGGCTAGAAC CTGGAGAATG AGAAAAATTT ACATGGCAAA 318 0 
3181 GAGCCCATAA ATCCTGACCA ATCCAACTCT GAATTTTAAA GCAAAAGCGT GAAAAAAAAG 3240 
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Figure 3 ID 

3241 ATTCCCTCCT TACCCCCAAC CCACTCTTTT TTCCCACCAC CCACTCTCCT CTGCCTCAGT 3300 
33 01 AAGTATCTGG AGGAAGAAAA CAGGTGAAAG AAGAAGTAAA AACCATTTAG TATTAGTATT 3360 

33 61 AGAATGAAGT CAAACTGTGC CACACATGGT GAATGAAAAA AAAAAAAAAG AGGCTGTGTT 3420 
3421 TTGTCACACA GGGCAGTCAT TCAGCACCAG AGCACGTGAT GGTCTGAGAC TCTCTTAGGA 3480 

34 81 GCAGAGCTCT GCCGCAATGG CCATGTGGGG ATCCACACCT GGTCTGAGGG GCAACTGAGT 3540 
3541 CTGCGGGAGA AGAGCGGCCC TATGCATGGT GTAGATGCCC TGATAAAGAA CATCTGTCCT 3600 
36 01 GTGAAAGACT CAATGAGCTG TTATGTTGTA AACAGGAAGC ATTTCACATC CAAACGAGAA 3 660 
3661 AATCATGTAA ACATGTGTCT TTTCTGTAGA GCATAATAAA TGGATGAGGT TTTTGCAAAA 3720 
3721 AAAAAAAAAA AAA 3733 
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Figure 32A 



1 CGCCCAACCCAAGTTCAAAGGCTGATAAGAGAGAAAATCTCATGAGGAGGTTTTAGTCTA 60 



61 GGGAAAGTCATTCAGTGGATGTGATCTTGGCTCACAGGGGACGATGTCAAGCTCTTCCTG 12 0 
1 MSSSSW6 



121 GCTCCTTCTCAGCCTTGTTGCTGTAACTGCTGCTCAGTCCACCATTGAGGAACAGGCCAA 18 0 

7 LLLSLVAVTAAQSTIEEQAK 26 

181 GACATTTTTGGACAAGTTTAACCACGAAGCCGAAGACCTGTTCTATCAAAGTTCACTTGC 240 

27 TFLDKFNHEAEDLFYQSSLA 46 



241 TTCTTGGAATTATAACAC CAATATTACTGAAGAGAATGTCCAAAACATGAATAATGCTGG 30 0 

47 SWNYNTNITEENVQNMNNAG 66 

301 GGACAAATGGTCTGCCTTTTTAAAGGAACAGTCCACACTTGCCCAAATGTATCCACTACA 3 60 

67 DKWSAFLKEQSTLAQMYPLQ 86 

361 AGAAATTCAGAATCTCACAGTCAAGCTTCAGCTGCAGGCTCTTCAGCAAAATGGGTCTTC 42 0 

87 E I QNLTVKLQ LQALQQNGS S 106 

421 AGTGCTCTCAGAAGACAAGAGCAAACGGTTGAACACAATTCTAAATACAATGAGCACCAT 480 

107 VLSEDKSKRLNTILNTMSTI 126 



481 CTACAGTACTGGAAAAGTTTGTAACCCAGATAATCCACAAGAATGCTTATTACTTGAACC 540 

127 YSTGKVCNPDNPQECLLLEP 146 

541 AGGTTTGAATGAAATAATGGCAAACAGTTTAGACTACAATGAGAGGCTCTGGGCTTGGGA 60 0 

147 GLNEIMANSLDYNERLWAWE 166 



601 AAGCTGGAGATCTGAGGTCGGCAAGCAGCTGAGGCCATTATATGAAGAGTATGTGGTCTT 660 

167 SWRS EVGKQLRPLYEEYVVL 186 

661 GAAAAATGAGATGGCAAGAGC AAATCATTATGAGGACTATGGGGATTATTGGAGAGGAGA 72 0 

187 KNEMARANHYEDYGDYWRGD 206 



721 CTATGAAGTAAATGGGGTAGATGGCTATGACTACAGCCGCGGCCAGTTGATTGAAGATGT 780 
207 YEVNGVDGYDYSRGQL I EDV 226 



781 GGAACATACCTTTGAAGAGATTAAACCATTATATGAACATCTTCATGCCTATGTGAGGGC 840 
227 EHTFEEIKPLYEHLHAYVRA 246 



841 AAAGTTGATGAATGCCTATCCTTCCTATATCAGTCCAATTGGATGCCTCCCTGCTCATTT 9 0 0 
247 KLMNAYPSYI SPIGCLPAHL 266 
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Figure 32B 

901 GCTTGGTGATATGTGGGGTAGATTTTGGACAAATCTGTACTCTTTGACAGTTCCCTTTGG 9 6 0 
267 LGDMWGRFWTNLYS LTVPFG 286 



961 ACAGAAACCAAACATAGATGTTACTGATGCAATGGTGGACCAGGCCTGGGATGCACAGAG 102 0 
287 QKPNIDVTDAMVDQAWDAQR 306 



1021 AATATTCAAGGAGGCCGAGAAGTTCTTTGTATCTGTTGGTCTTCCTAATATGACTCAAGG 1080 
307 IFKEAEKFFVSVGLPNMTQG 326 



10 81 ATTCTGGGAAAATTCCATGCTAACGGACCCAGGAAATGTTCAGAAAGCAGTCTGCCATCC 1140 

327 FWENSMLTDPGNVQKAVCHP 346 

1141 CACAGCTTGGGACCTGGGGAAGGGCGACTTCAGGATCCTTATGTGCACAAAGGTGACAAT 12 00 

347 TAWDLGKGDFRI LMCTKVTM 366 



1201 GGACGACTTCCTGACAGCTCATCATGAGATGGGGCATATCCAGTATGATATGGC ATATGC 12 60 

367 DDFLTAHHEMGHI'QYDMAYA 386 

12 61 TGCACAACCTTTTCTGCTAAGAAATGGAGCTAATGAAGGATTCCATGAAGCTGTTGGGGA 132 0 
387 AQPFLLRNGANEGFHEAVGE 406 

1321 AATCATGTCACTTTCTGCAGCCAC ACCTAAGC ATTTAAAATCCATTGGTCTTCTGTCACC 13 80 

407 IMSLSAATPKHLKSIGLLSP 426 

13 81 CGATTTTCAAGAAGACAATGAAACAGAAATAAACTTCCTGCTCAAACAAGCACTCACGAT 144 0 
427 DFQEDNETEINFLLKQALTI 446 



1441 TGTTGGGACTCTGCCATTTACTTACATGTTAGAGAAGTGGAGGTGGATGGTCTTTAAAGG 15 00 

447 VGTLPFTYMLEKWRWMVFKG 466 

1501 GGAAATTCCCAAAGACCAGTGGATGAAAAAGTGGTGGGAGATGAAGCGAGAGATAGTTGG 15 60 

467 EIPKDQWMKKWWEMKREIVG 486 



1561 GGTGGTGGAACCTGTGCCCCATGATGAAACATACTGTGACCCCGCATCTCTGTTCCATGT 162 0 
487 VVEPVPHDETYCDPASLFHV 506 



1621 TTCTAATGATTACTCATTCATTCGATATTACACAAGGACCCTTTACCAATTCCAGTTTCA 1680 
507 SNDYSFIRYYTRTLYQFQFQ 526 



1681 AGAAGCACTTTGTCAAGCAGCTAAACATGAAGGCCCTCTGCACAAATGTGACATCTCAAA 17 40 
527 EALCQAAKHEGPLHKCDI SN 546 
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Figure 32C 



1741 CTCTACAGAAGCTGGACAGAAACTGTTCAATATGCTGAGGCTTGGAAAATCAGAACCCTG 1800 
547 STEAGQKLFNMLRLGKSEPW 566 



1801 GACCCTAGCATTGGAAAATGTTGTAGGAGCAAAGAACATGAATGTAAGGCCACTGCTCAA 18 60 

567 TLALENVVGAKNMNVRPLLN 586 

1861 CTACTTTGAGCCCTTATTTACCTGGCTGAAAGACC AGAACAAGAATTCTTTTGTGGGATG 192 0 

587 YFEPLFTWLKDQNKNSFVGW 606 

1921 GAGTACCGACTGGAGTCCATATGCAGACCAAAGCATCAAAGTGAGGATAAGCCTAAAATC 19 80 

607 STDWSPYADQSIKVRISLKS 626 

1981 AGCTCTTGGAGATAAAGCATATGAATGGAACGACAATGAAATGTACCTGTTCCGATCATC 2 0 40 

627 ALGDKAYEWNDNEMYLFRSS 646 



2041 TGTTGCATATGCTATGAGGCAGTACTTTTTAAAAGTAAAAAATCAGATGATTCTTTTTGG 2100 
647 VAYAMRQYFLKVKNQMILFG 666 



2101 GGAGGAGGATGTGCGAGTGGCTAATTTGAAACCAAGAATCTCCTTTAATTTCTTTGTCAC 2160 

667 EEDVRVANLKPRI S FNFFVT 686 

2161 TGCACCTAAAAATGTGTCTGATATCATTCCTAGAACTGAAGTTGAAAAGGCCATCAGGAT 2 22 0 

687 APKNVSDIIPRTEVEKAIRM 706 



2221 GTCCCGGAGCCGTATCAATGATGCTTTCCGTCTGAATGACAACAGCCTAGAGTTTCTGGG 2 2 8 0 
707 SRSRINDAFRLNDNSLEFLG 726 



2281 GATACAGCCAACACTTGGACCTCCTAACCAGCCCCCTGTTTCCATATGGCTGATTGTTTT 2 3 40 
727 IQPTLGPPNQPPVSIWLIVF 746 



2341 TGGAGTTGTGATGGGAGTGATAGTGGTTGGCATTGTCATCCTGATCTTCACTGGGATCAG 2 400 
747 GVVMGVIVVGIVILIFTGIR 766 



2401 AGATCGGAAGAAGAAAAATAAAGCAAGAAGTGGAGAAAATCCTTATGCCTCCATCGATAT 2460 
767 DRKKKNKARSGENPYAS IDI 786 



2 461 TAGCAAAGGAGAAAATAATCCAGGATTCCAAAACACTGATGATGTTCAGACCTCCTTTTA 2 52 0 
787 SKGENNPGFQNTDDVQTSF* 806 



2521 GAAAAATCTATGTTTTTCCTCTTGAGGTGATTTTGTTGTATGTAAATGTTAATTTCATGG 2 580 



25 81 TATAGAAAATATAAGATGATAAAGATATCATTAAATGTCAAAACTATGACTCTGTTCAGA 2 640 



Figure 32D 

2 641 AAAAAAATTGTCCAAAGACAACATGGCCAAGGAGAGAGCATCTTCATTGACATTGCTTTC 27 00 

2701 AGTATTTATTTCTGTCTCTGGATTTGACTTCTGTTCTGTTTCTTAATAAGGATTTTGTAT 2760 

2761 TAGAGTATATTAGGGAAAGTGTGTATTTGGTCTCACAGGCTGTTCAGGGATAATCTAAAT 2 82 0 

2 821 GTAAATGTCTGTTGAATTTCTGAAGTTGAAAACAAGGATATATC ATTGGAGCAAGTGTTG 2 880 

2 881 GATCTTGTATGGAATATGGATGGATCACTTGTAAGGACAGTGCCTGGGAACTGGTGTAGC 2 940 

2 941 TGCAAGGATTGAGAATGGCATGCATTAGCTCACTTTCATTTAATCCATTGTCAAGGATGA 3 000 

3 001 CATGCTTTCTTCACAGTAACTCAGTTCAAGTACTATGGTGATTTGCCTACAGTGATGTTT 3 060 
3 061 GGAATCGATCATGCTTTCTTCAAGGTGACAGGTCTAAAGAGAGAAGAATCCAGGGAACAG 3120 
3121 GTAGAGGACATTGCTTTTTCACTTCCAAGGTGCTTGATCAACATCTCCCTGACAACACAA 3180 
3181 AACTAGAGCCAGGGGCCTCCGTGAACTCCCAGAGCATGCCTGATAGAAACTCATTTCTAC 3 2 40 
3241 TGTTCTCTAACTGTGGAGTGAATGGAAATTCCAACTGTATGTTCACCCTCTGAAGTGGGT 3 3 00 
3 301 ACCCAGTCTCTTAAATCTTTTGTATTTGCTCACAGTGTTTGAGCAGTGCTGAGCACAAAG 3 3 60 
3 3 61 CAGACACTCAATAAATGCTAGATTTACACACTCAAAAAAAAAAAA 340 5 
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Figure 33A 



1 ATGCATCTTATCGACTACCTGCTCCTCCTGCTGGTTGGACTACTGGCCCTTTCTCATGGC 6 0 
1MHLIDYLLLLLVGLLALSHG 20 



61 CAGCTGCACGTTGAGCATGATGGTGAGAGTTGCAGTAACAGCTCCCACCAGCAGATTCTG 12 0 

21QLHVEHDGESCSNSSHQQIL 40 

121 GAGACAGGTGAGGGCTCCCCCAGCCTCAAGATAGCCCCTGCCAATGCTGACTTTGCCTTC 18 0 

41 ETGEGSPSLKIAPANADFAF 60 



181 CGCTTCTACTACCTGATCGCTTCGGAGACCCCGGGGAAGAACATCTTTTTCTCCCCGCTG 240 
61RFYYLIASETPGKNIFFSPL 80 



241 AGCATCTCGGCGGCCTACGCCATGCTTTCCCTGGGGGCCTGCTCACACAGCCGCAGCCAG 3 0 0 
81SISAAYAMLSLGACSHSRSQ 100 



301 ATCCTTGAGGGCCTGGGCTTCAACCTCACCGAGCTGTCTGAGTCCGATGTCCATAGGGGC 3 60 
101 ILEGLGFNLTELSESDVHRG 120 



361 TTCCAGCACCTCCTGCACACTCTCAACCTCCCCGGCCATGGGCTGGAAACACGCGTGGGC 42 0 

121 FQHLLHTLNLPGHGLETRVG 140 

421 AGTGCTCTGTTCCTGAGCCACAACCTGAAGTTCCTTGCAAAATTCCTGAATGACACCATG 480 

141 SALFLSHNLKFLAKFLNDTM 160 



481 GCCGTCTATGAGGCTAAACTCTTCCACACCAACTTCTACGACACTGTGGGCACAATCCAG 540 
161 AVYEAKLFHTNFYDTVGTIQ 180 



541 CTTATCAACGACCACGTCAAGAAGGAAACTCGAGGGAAGATTGTGGATTTGGTCAGTGAG 600 
181 LINDHVKKETRGKIVDLVSE 200 



601 CTCAAGAAGGACGTCTTGATGGTGCTGGTGAATTACATTTACTTCAAAGCCCTGTGGGAG 66 0 
201 LKKDVLMVLVNYIYFKALWE 220 



661 AAACCATTCATTTCCTCAAGGACCACTCCCAAAGACTTCTATGTTGATGAGAACACAACA 72 0 
221 KPF I SSRTTPKDFYVDENTT 240 



721 GTCCGGGTGCCCATGATGCTGCAGGACCAGGAGCATCACTGGTATCTTCATGACAGATAC 7 8 0 

241 VRVPMMLQDQEHHWYLHDRY 260 

781 TTGCCCTGCTCGGTGCTACGGATGGATTACAAAGGAGACGCAACCGTGTTTTTCATTCTC 840 

261 L PC SVLRMDYKGDATVFF I L 280 
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Figure 33B 

841 CCTAACCAAGGCAAAATGAGGGAGATTGAAGAGGTTCTGACTCCAGAGATGCTAATGAGG 9 00 

281 PNQGKMRE IEEVLTPEMLMR 300 

901 TGGAACAACTTGTTGCGGAAGAGGAATTTTTACAAGAAGCTAGAGTTGCATCTTCCCAAG 9 60 

301 WNNLLRKRNFYKKLELHLPK 320 

961 TTCTCCATTTCTGGCTCCTATGTATTAGATCAGATTTTGCCCAGGCTGGGCTTCACGGAT 1020 

321 FSISGSYVLDQILPRLGFTD 340 



1021 CTGTTCTCCAAGTGGGCTGACTTATCCGGCATCACCAAACAGCAAAAACTGGAGGCATCC 1080 
341 LFSKWADLSGITKQQKLEAS 360 



1081 AAAAGTTTCCACAAGGCCACCTTGGACGTGGATGAGGCTGGCACCGAGGCTGCAGCAGCC 1140 
361 KSFHKATLDVDEAGTEAAAA 380 



1141 ACCACGTTCGCGATCAAATTCTTCTCTGCCCAGACCAATCGCCACATCCTGCGATTCAAC 12 00 
381 TTFAIKFFSAQTNRHILRFN 400 



1201 CGGCCCTTCCTTGTGGTGATCTTTTCCACC AGC ACCCAGAGTGTCCTCTTTCTGGGCAAG 12 60 
401 RPFLVVIFSTSTQSVLFLGK 420 



12 61 GTCGTCGACCCCACGAAACCATAG 12 84 
421 VVDPTKP* 428 
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Figure 34A 



1 ATGCATCTTATCGACTACCTGCTCCTCCTGCTGGTTGGACTACTGGCCCTTTCTCATGGC 6 0 
1MHLIDYLLLLLVGLLALSHG 20 



6 1 CAGCTGCACGTTGAGCATGATGGTGAGAGTTGCAGTAACAGCTCCCACCAGCAGATTCTG 12 0 

21QLHVEHDGESCSNSSHQQIL 40 

121 GAGACAGGTGAGGGCTCCCCCAGCCTCAAGATAGCCCCTGCCAATGCTGACTTTGCCTTC 180 

41 ETGEGS PSLKIAPANADFAF 60 

181 CGCTTCTACTACCTGATCGCTTCGGAGACCCCGGGGAAGAACATCTTTTTCTCCCCGCTG 240 

61RFYYLIASETPGKNIFFSPL 80 



241 AGCATCTCGGCGGCCTACGCCATGCTTTCCCTGGGGGCCTGCTCACACAGCCGCAGCCAG 300 
81 SI SAAYAMLSLGACSHSRSQ 100 



301 ATCCTTGAGGGCCTGGGCTTCAACCTCACCGAGCTGTCTGAGTCCGATGTCCATAGGGGC 360 

101 ILEGLGFNLTELSESDVHRG 120 

3 61 TTCCAGCACCTCCTGCACACTCTCAACCTCCCCGGCCATGGGCTGGAAACACGCGTGGGC 42 0 

121 FQHLLHTLNL PGHGLETRVG 140 

421 AGTGCTCTGTTCCTGAGCCACAACCTGAAGTTCCTTGCAAAATTCCTGAATGACACCATG 480 

141 SALFLSHNLKFLAKFLNDTM 160 



481 GCCGTCTATGAGGCTAAACTCTTCCACACCAACTTCTACGACACTGTGGGCACAATCCAG 540 
161 AVYEAKLFHTNFYDTVGTIQ 180 



541 CTTATCAACGACCACGTCAAGAAGGAAACTCGAGGGAAGATTGTGGATTTGGTCAGTGAG 60 0 
181 LINDHVKKETRGKIVDLVSE 200 



601 CTCAAGAAGGACGTCTTGATGGTGCTGGTGAATTACATTTACTTCAAAGCCCTGTGGGAG 660 

201 LKKDVLMVLVNY IYFKALWE 220 

661 AAACCATTCATTTCCTCAAGGACCACTCCCAAAGACTTTTATGTTGATGAGAACACAACA 72 0 

221 KPFISSRTTPKDFYVDENTT 240 

721 GTCCGGGTGCCC ATGATGCTGCAGGACCAGGAGCATCACTGGTATCTTCATGACAGATAC 78 0 

241 VRVPMMLQDQEHHWYLHDRY 260 

781 TTGCCCTGCTCGGTGCTACGGATGGATTACAAAGGAGACGCAACCGTGTTTTTCATTCTC 840 

261 LPCSVLRMDYKGDATVFFIL 280 
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Figure 34B 

841 CCTAACCAAGGCAAAATGAGGGAGATTGAAGAGGTTCTGACTCCAGAGATGCTAATGAGG 900 

281 PNQGKMREIEEVLTPEMLMR 300 

901 TGGAACAACTTGTTGCGGAAGAGGAATTTTTACAAGAAGCTAGAGTTGCATCTTCCCAAG 9 60 

301 WNNLLRKRNFYKKLELHLPK 320 



961 TTCTCCATTTCTGGCTCCTATGTATTAGATCAGATTTTGCCCAGGCTGGGCTTCACGGAT 102 0 

321 FS I SGSYVLDQILPRLGFTD 340 

1021 CTGTTCTCCAAGTGGGCTGACTTATCCGGCATCACCAAACAGCAAAAACTGGAGGCATCC 1080 

341 LFSKWADLSGITKQQKLEAS 360 



1081 AAAAGTTTCCACAAGGCCACCTTGGACGTGGATGAGGCTGGCACCGAGGCTGCAGCAGCC 1140 

361 KSFHKATLDVDEAGTEAAAA 380 

1141 ACCACGTTCGCGATCAAATTCTTCTCTGCCCAGACCAATCGCCACATCCTGCGATTCAAC 12 00 

381 TTFAIKFFSAQTNRHILRFN 400 

12 01 CGGCCCTTCCTTGTGGTGATCTTTTCCACCAGCACCCAGAGTGTCCTCTTTCTGGGCAAG 12 60 

401 RPFLVVIFSTSTQSVLFLGK 420 



1261 GTCGTCGACCCCACGAAACCATAG 1284 
421 VVDPTKP* 428 
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Figure 35A 



1 ATGCATCTTATCGACTACCTGCTCCTCCTGCTGGTTGGACTACTGGCCCTTTCTCATGGC 6 0 
1MHLIDYLLLLLVGLLALSHG 20 



61 C AGC TGC ACGTTGAGC ATGATGGTGAGAGTTGC AGT AAC AGC TC CC AC C AGC AGATTCTG 12 0 
21QLHVEHDGESCSNSSHQQIL 40 



121 GAGACAGGTGAGGGCTCCCCCAGCCTCAAGATAGCCCCTGCCAATGCTGACTTTGCCTTC 180 
41 ETGEGSPSLKIAPANADFAF 60 



181 CGCTTCTACTACCTGATCGCTTCGGAGACCCCGGGGAAGAACATCTTTTTCTCCCCGCTG 240 
61RFYYLIASETPGKNIFFSPL 80 



241 AGCATCTCGGCGGCCTACGCCATGCTTTCCCTGGGGGCCTGCTCACACAGCCGCAGCCAG 300 
81 S I SAAYAMLSLGAC SHSRSQ 100 



301 ATCCTTGAGGGCCTGGGCTTCAACCTCACCGAGCTGTCTGAGTCCGATGTCCATAGGGGC 360 
101 ILEGLGFNLTELSESDVHRG 120 



361 TTCCAGCACCTCCTGCACACTCTCAACCTCCCCGGCCATGGGCTGGAAACACGCGTGGGC 42 0 
121 FQHLLHTLNLPGHGLETRVG 140 



421 AGTGCTCTGTTCCTGAGCCACAACCTGAAGTTCCTTGCAAAATTCCTGAATGACACCATG 480 
141 SALFLSHNLKFLAKFLNDTM 160 



481 GCCGTCTATGAGGCTAAACTCTTCCACACCAACTTCTACGACACTGTGGGCACAATCCAG 540 

161 AVYEAKLFHTNFYDTVGTIQ 180 

541 CTTATCAACGACCACGTCAAGAAGGAAACTCGAGGGAAGATTGTGGATTTGGTCAGCGAG 600 

181 LINDHVKKETRGKIVDLVSE 200 



601 CTCAAGAAGGACGTCTTGATGGTGCTGGTGAATTACATTTACTTCAAAGCCCTGTGGGAG 66 0 
201 LKKDVLMVLVNYIYFKALWE 220 



661 AAACCATTCATTTCCTCAAGGACCACTCCCAAAGACTTCTATGTTGATGAGAACACAACA 72 0 

221 KPF I S S RTTPKDFYVDENTT 240 

721 GTCCGGGTGCCC ATGATGCTGCAGGACCAGGAGCATCACTGGTATCTTC ATGAC AGATAC 7 80 

241 VRVPMMLQDQEHHWYLHDRY 260 

781 TTGCCCTGCTCGGTGCTACGGATGGATTACAAAGGAGACGCAACCGTGTTTTTCATTCTC 840 

261 LPCSVLRMDYKGDATVFFIL 280 
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Figure 35B 



841 CCTAACCAAGGCAAAATGAGGGAGATTGAAGAGGTTCTGACTCCAGAGATGCTAATGAGG 9 00 

281 PNQGKMREIEEVLTPEMLMR 300 

9 01 TGGAACAACTTGTTGCGGAAGAGGAATTTTTACAAGAAGCTAGAGTTGCATCTTCCCAAG 9 60 

301 WNNLLRKRNFYKKLELHLPK 320 

961 TTCTCCATTTCTGGCTCCTATGTATTAGATCAGATTTTGCCCAGGCTGGGCTTCACGGAT 102 0 

321 FSISGSYVLDQILPRLGFTD 340 



1021 CTGTTCTCCAAGTGGGCTGACTTATCCGGCATCACCAAACAGCAAAAACTGGAGGCATCC 1080 
341 LF SKWADL SGITKQQKLEAS 360 



1081 AAAAGTTTCCACAAGGCCACCTTGGACGTGGATGAGGCTGGCACCGAGGCTGCAGCAGCC 1140 
361 KSFHKATLDVDEAGTEAAAA 380 



1141 ACCACGTTCGCGATCAAATTCTTCTCTGCCCAGACCAATCGCCACATCCTGCGATTCAAC 12 00 

381 TTFAIKFFSAQTNRHILRFN 400 

12 01 CGGCCCTTCCTTGTGGTGATCTTTTCCACCAGCACCCAGAGTGTCCTCTTTCTGGGCAAG 12 60 

401 RPFLVVIF STSTQSVLFLGK 420 



12 61 GTC GTC GAC CC C AC GAAAC CAT AG 12 84 
421 VVDPTKP* 428 
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Figure 36A 

1 ATGCATCTTATCGACTACCTGCTCCTCCTGCTGGTTGGACTACTGGCCCTTTCTCATGGC 6 0 

1MHLIDYLLLLLVGLLALSHG 20 

6 1 CAGCTGCACGTTGAGCATGATGGTGAGAGTTGCAGTAACAGCTCCCACCAGCAGATTCTG 120 

21QLHVEHDGESCSNSSHQQIL 40 

121 GAGACAGGTGAGGGCTCCCCCAGCCTCAAGATAGCCCCTGCCAATGCTGACTTTGCCTTC 180 

41 ETGEGSPSLKIAPANADFAF 60 

181 CGCTTCTACTACCTGATCGCTTCGGAGACCCCGGGGAAGAACATCTTTTTCTCCCCGCTG 240 

61RFYYLIASETPGKNIFFSPL 80 



2 41 AGCATCTCGGCGGCCTACGCCATGCTTTCCCTGGGGGCCTGCTCACACAGCCGCAGCCAG 3 00 
81 SI SAAYAMLSLGACSHSRSQ 100 



3 01 ATCCTTGAGGGCCTGGGCTTCAACCTCACCGAGCTGTCTGAGTCCGATGTCCATAGGGGC 3 60 
101 ILEGLGFNLTELSESDVHRG 120 



361 TTCCAGCACCTCCTGCACACTCTCAACCTCCCCGGCCATGGGCTGGAAACACGCGTGGGC 420 

121 FQHLLHTLNLPGHGLETRVG 140 

421 AGTGCTCTGTTCCTGAGCCACAACCTGAAGTTCCTTGCAAAATTCCTGAATGACACCATG 480 

141 SALFLSHNLKFLAKFLNDTM 160 

481 GCCGTCTATGAGGCTAAACTCTTCCACACCAACTTCTACGACACTGTGGGCACAATCCAG 540 

161 AVYEAKLFHTNFYDTVGTIQ 180 



541 CTTATCAACGACCACGTCAAGAAGGAAACTCGAGGGAAGATTGTGGATTTGGTCAGNGAG 600 
181 LINDHVKKETRGKIVDLVSE 200 



601 CTCAAGAAGGACGTCTTGATGGTGCTGGTGAATTACATTTACTTCAAAGCCCTGTGGGAG 660 

201 LKKDVLMVLVNYIYFKALWE 220 

661 AAACCATTCATTTCCTCAAGGACCACTCCCAAAGACTTNTATGTTGATGAGAACACAACA 720 

221 KPFISSRTTPKDFYVDENTT 240 

721 GTCCGGGTGCCCATGATGCTGCAGGACCAGGAGCATCACTGGTATCTTCATGACAGATAC 7 80 

241 VRVPMMLQDQEHHWYLHDRY 260 



7 81 TTGCCCTGCTCGGTGCTACGGATGGATTAC AAAGGAGACGCAACCGTGTTTTTCATTCTC 84 0 
261 LPCSVLRMDYKGDATVFFIL 280 
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Figure 36B 

841 C C T AAC C AAGGC AAAAT G AGGGAG ATTGAAGAGGTT CTGAC T C C AGAG ATGC T AATGAGG 900 

281 PNQGKMREIEEVLTPEMLMR 300 

9 01 TGGAACAACTTGTTGCGGAAGAGGAATTTTTACAAGAAGCTAGAGTTGCATCTTCCCAAG 9 60 

301 WNNLLRKRNFYKKLELHLPK 320 

9 61 TTCTCCATTTCTGGCTCCTATGTATTAGATCAGATTTTGCCCAGGCTGGGCTTCACGGAT 1020 

321 FSISGSYVLDQILPRLGFTD 340 

1021 CTGTTCTCCAAGTGGGCTGACTTATCCGGCATCACCAAACAGCAAAAACTGGAGGCATCC 1080 

341 LFSKWADLSGITKQQKLEAS 360 



10 81 AAAAGTTTCCACAAGGCCACCTTGGACGTGGATGAGGCTGGCACCGAGGCTGCAGCAGCC 1140 

361 KSFHKATLDVDEAGTEAAAA 380 

1141 ACCACGTTCGCGATCAAATTCTTCTCTGCCCAGACCAATCGCCACATCCTGCGATTCAAC 1200 

381 TTFAIKFFSAQTNRHI LRFN 400 

12 01 CGGCCCTTCCTTGTGGTGATCTTTTCCACCAGCACCCAGAGTGTCCTCTTTCTGGGCAAG 1260 

401 RPFLVVIFSTSTQSVLFLGK 420 



1261 

421 



GTCGTCGACCCCACGAAACCATAG 1284 
VVDPTKP* 428 
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Figure 37A 



1 CGCCCAACCCAAGTTCAAAGGCTGATAAGAGAGAAAATCTCATGAGGAGGTTTTAGTCTA 60 



61 GGGAAAGTCATTCAGTGGATGTGATCTTGGCTCACAGGGGACGATGTCAAGCTCTTCCTG 12 0 
1 MSSSSW6 



121 GCTCCTTCTCAGCCTTGTTGCTGTAACTGCTGCTCAGTCCACCATTGAGGAACAGGCCAA 180 

7 LLLSLVAVTAAQSTIEEQAK 26 

181 GACATTTTTGGACAAGTTTAACCACGAAGCCGAAGACCTGTTCTATCAAAGTTCACTTGC 240 

27 TFLDKFNHEAEDLFYQSSLA 46 



241 TTCTTGGAATTATAACACCAATATTACTGAAGAGAATGTCCAAAACATGAATAATGCTGG 300 

47 SWNYNTNI TEENVQNMNNAG 66 

3 01 GGACAAATGGTCTGCCTTTTTAAAGGAACAGTCCACACTTGCCCAAATGTATCCACTACA 360 

67 DKWSAFLKEQSTLAQMYPLQ 86 

3 61 AGAAATTCAGAATCTCACAGTCAAGCTTCAGCTGCAGGCTCTTCAGCAAAATGGGTCTTC 420 

87 EIQNLTVKLQLQALQQNGSS 106 

421 AGTGCTCTCAGAAGACAAGAGCAAACGGTTGAACACAATTCTAAATACAATGAGCACCAT 480 

107 VLSEDKSKRLNTILNTMSTI 126 



481 CTACAGTACTGGAAAAGTTTGTAACCCAGATAATCCACAAGAATGCTTATTACTTGAACC 540 

127 YSTGKVCNPDNPQECLLLEP 146 

541 AGGTTTGAATGAAATAATGGCAAACAGTTTAGACTACAATGAGAGGCTCTGGGCTTGGGA 600 

147 GLNEIMANSLDYNERLWAWE 166 



601 AAGCTGGAGATCTGAGGTCGGCAAGCAGCTGAGGCCATTATATGAAGAGTATGTGGTCTT 660 
167 SWRSEVGKQLRPLYEEYVVL 186 



661 GAAAAATGAGATGGCAAGAGCAAATCATTATGAGGACTATGGGGATTATTGGAGAGGAGA 720 
187 KNEMARANHYEDYGDYWRGD 206 



721 CTATGAAGTAAATGGGGTAGATGGCTATGACTACAGCCGCGGCCAGTTGATTGAAGATGT 7 80 
207 YEVNGVDGYDYSRGQLIEDV 226 



7 81 GGAACATACCTTTGAAGAGATTAAACCATTATATGAACATCTTCATGCCTATGTGAGGGC 840 

227 EHTFEEIKPLYEHLHAYVRA 246 

841 AAAGTTGATGAATGCCTATCCTTCCTATATCAGTCCAATTGGATGCCTCCCTGCTCATTT 9 00 

247 KLMNAYPSYISPIGCLPAHL 266 
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Figure 37B 

9 01 GCTTGGTGATATGTGGGGTAGATTTTGGACAAATCTGTACTCTTTGACAGTTCCCTTTGG 960 

267 LGDMWGRFWTNLYSLTVPFG 286 

9 61 ACAGAAACCAAACATAGATGTTACTGATGC AATGGTGGACCAGGCCTGGGATGC ACAGAG 102 0 

287 QKPNIDVTDAMVDQAWDAQR 306 



1021 AATATTCAAGGAGGCCGAGAAGTTCTTTGTATCTGTTGGTCTTCCTAATATGACTCAAGG 108 0 
307 IFKEAEKFFVSVGLPNMTQG 326 



1081 ATTCTGGGAAAATTCCATGCTAACGGACCCAGGAAATGTTCAGAAAGCAGTCTGCCATCC 1140 

327 FWENSMLTDPGNVQKAVCHP 346 

1141 CACAGCTTGGGACCTGGGGAAGGGCGACTTCAGGATCCTTATGTGCACAAAGGTGACAAT 1200 

347 TAWDLGKGDFRI LMCTKVTM 366 



12 01 GGACGACTTCCTGACAGCTCATCATGAGATGGGGCATATCCAGTATGATATGGCATATGC 12 60 
367 DDFLTAHHEMGHIQYDMAYA 386 



12 61 TGCACAACCTTTTCTGCTAAGAAATGGAGCTAATGAAGGATTCCATGAAGCTGTTGGGGA 132 0 

387 AQPFLLRNGANEGFHEAVGE 406 

1321 AATCATGTCACTTTCTGCAGCCACACCTAAGCATTTAAAATCCATTGGTCTTCTGTCACC 1380 

407 IMSLSAATPKHLKSIGLLSP 426 



13 81 CGATTTTC AAGAAGACAATGAAACAGAAAT AAACTTCCTGCTCAAACAAGCACTCACGAT 1440 

427 DFQEDNETEINFLLKQALTI 446 

1441 TGTTGGGACTCTGCCATTTACTTACATGTTAGAGAAGTGGAGGTGGATGGTCTTTAAAGG 1500 

447 VGTLPFTYMLEKWRWMVFKG 466 



1501 GGAAATTCCCAAAGACCAGTGGATGAAAAAGTGGTGGGAGATGAAGCGAGAGATAGTTGG 1560 

467 EIPKDQWMKKWWEMKREIVG 486 

1561 GGTGGTGGAACCTGTGCCCCATGATGAAACATACTGTGACCCCGCATCTCTGTTCCATGT 162 0 

487 VVEPVPHDETYCDPASLFHV 506 



1621 TTCTAATGATTACTCATTCATTCGATATTACACAAGGACCCTTTACCAATTCC AGTTTCA 1680 
507 SNDYSFIRYYTRTLYQFQFQ 526 



1681 AGAAGCACTTTGTCAAGCAGCTAAACATGAAGGCCCTCTGCACAAATGTGACATCTCAAA 1740 
527 EALCQAAKHEGPLHKCDISN 546 
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Figure 37C 



1741 CTCTACAGAAGCTGGACAGAAACTGTTCAATATGCTGAGGCTTGGAAAATCAGAACCCTG 18 0 0 

547 STEAGQKLFNMLRLGKSEPW 566 

1801 GACCCTAGCATTGGAAAATGTTGTAGGAGCAAAGAACATGAATGTAAGGCCACTGCTCAA 186 0 

567 TLALENVVGAKNMNVRPLLN 586 

18 61 CT AC TTTGAGC CCTT ATTTACC TGGCTGAAAGAC CAGAAC AAGAATTC TTTTGTGGGATG 192 0 
587 YFEPLFTWLKDQNKNSFVGW 606 

1921 GAGTACCGACTGGAGTCCATATGCAGACCAAAGCATCAAAGTGAGGATAAGCCTAAAATC 19 8 0 

607 STDWSPYADQSIKVRISLKS 626 

19 81 AGCTCTTGGAGATAAAGCATATGAATGGAACGACAATGAAATGTACCTGTTCCGATCATC 2 040 
627 ALGDKAYEWNDNEMYLFRSS 646 

2 041 TGTTGCATATGCTATGAGGCAGTACTTTTTAAAAGTAAAAAATCAGATGATTCTTTTTGG 2100 

647 VAYAMRQYFLKVKNQMILFG 666 

2101 GGAGGAGGATGTGCGAGTGGCTAATTTGAAACCAAGAATCTCCTTTAATTTCTTTGTCAC 2160 

667 EEDVRVANLKPRI SFNFFVT 686 



2161 TGCACCTAAAAACGTGTCTGATATCATTCCTAGAACTGAAGTTGAAAAGGCCATCAGGAT 2 22 0 

687 APKNVSDI I PRTEVEKAIRM 706 

2221 GTCCCGGAGCCGTATCAATGATGCTTTCCGTCTGAATGACAACAGCCTAGAGTTTCTGGG 22 80 

707 SRSRINDAFRLNDNSLEFLG 726 

22 81 GATACAGCCAACACTTGGACCTCCTAACCAGCCCCCTGTTTCCATATGGCTGATTGTTTT 2 340 

727 IQPTLGPPNQPPVSIWLIVF 746 

2341 TGGAGTTGTGATGGGAGTGATAGTGGTTGGCATTGTCATCCTGATCTTCACTGGGATCAG 2400 

747 GVVMGVIVVGIVILIFTGIR 766 

2401 AGATCGGAAGAAGAAAAATAAAGCAAGAAGTGGAGAAAATCCTTATGCCTCCATCGATAT 2460 

767 DRKKKNKARSGENPYAS I D I 786 



2461 TAGCAAAGGAGAAAATAATCCAGGATTCCAAAACACTGATGATGTTCAGACCTCCTTTTA 2 520 
787 SKGENNPGFQNTDDVQTSF* 806 

2 521 GAAAAATCTATGTTTTTCCTCTTGAGGTGATTTTGTTGTATGTAAATGTTAATTTCATGG 25 80 



2 581 TATAGAAAATATAAGATGATAAAGATATCATTAAATGTCAAAACTATGACTCTGTTCAGA 2 640 



Figure 37D 

2 641 AAAAAAATTGTCCAAAGACAACATGGCCAAGGAGAGAGCATCTTCATTGACATTGCTTTC 27 0 0 

27 01 AGTATTTATTTCTGTCTCTGGATTTGACTTCTGTTCTGTTTCTTAATAAGGATTTTGTAT 276 0 
2761 TAGAGTATATTAGGGAAAGTGTGTATTTGGTCTC AC AGGCTGTTCAGGGATAATCTAAAT 2 82 0 

2 821 GTAAATGTCTGTTGAATTTCTGAAGTTGAAAACAAGGATATATCATTGGAGCAAGTGTTG 2 88 0 

28 81 GATCTTGTATGGAATATGGATGGATCACTTGTAAGGACAGTGCCTGGGAACTGGTGTAGC 2 94 0 

29 41 TGCAAGGATTGAGAATGGCATGCATTAGCTCACTTTCATTTAATCCATTGTC AAGGATGA 3 000 

30 01 CATGCTTTCTTCACAGTAACTCAGTTCAAGTACTATGGTGATTTGCCTACAGTGATGTTT 3 060 

3 061 GGAATCGATCATGCTTTCTTCAAGGTGACAGGTCTAAAGAGAGAAGAATCCAGGGAAC AG 312 0 
3121 GTAGAGGACATTGCTTTTTCACTTCCAAGGTGCTTGATCAACATCTCCCTGACAACACAA 3180 
3181 AACTAGAGCCAGGGGCCTCCGTGAACTCCCAGAGCATGCCTGATAGAAACTCATTTCTAC 32 40 
3241 TGTTCTCTAACTGTGGAGTGAATGGAAATTCCAACTGTATGTTCACCCTCTGAAGTGGGT 33 00 
3301 ACCCAGTCTCTTAAATCTTTTGTATTTGCTCACAGTGTTTGAGCAGTGCTGAGCACAAAG 3 3 60 
3361 CAGACACTCAATAAATGCTAGATTTACACACTCAAAAAAAAAAAA 3405 
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Figure 38A 

1 ATGTTCTCTC CCTGGAAGAT ATCAATGTTT CTGTCTGTTN GTGAGGNCTC CGTGCCCACC 60 
1MFSP WKI SMF LSVX EXS VPT 20 



61 ACGGCCTCTT TCAGCGCCGA CATGCTCAAT GTCACCTTGC AAGGGCCCAC TCTTAACGGG 12 0 
21TASF SAD MLN VTLQ GPT LNG 40 



121 ACCTTTGCCC AGAGCAAATG CCCCCAAGTG GAGTGGC TGG GCTGGCTCAA CACCATCCAG 180 
41TFAQ SKC PQV EWLG WLN TIQ 60 



181 CCCCCCTTCC TCTGGGTGCT GTTCGTGCTG GCCACCCTAG AGAACATCTT TGTCCTCAGC 240 
61PPFL WVL FVL ATLE NIF VLS 80 



241 GTCTTCTGCC TGCACAAGAG CAGCTGCACG GTGGCAGAGA TCTACCTGGG GAACCTGGCC 300 
81 V F C L HKS SCT VAEI YLG NLA 100 



3 01 GCAGCAGACC TGATCCTGGC CTGCGGGCTG CCCTTCTGGG CCATCACCAT CTCCAACAAC 360 
101 A A D L I L A CGL PFWA ITI SNN 120 



3 61 TTCGACTGGC TCTTTGGGGA GACGCTCTGC CGCGTGGTGA ATGCCATTAT CTCCATGAAC 42 0 
121 FDWL FGE TLC RVVN All SMN 140 



421 CTGTACAGCA GCATCTGTTT CCTGATGCTG GTGAGCATCG ACCGCTACCT GGCCCTGGTG 480 
141 LYSS ICF LML VSID RYL A L V 160 



481 AAAACCATGT CCATGGGCCG GATGCGCGGC GTGCGCTGGG CCAAGCTCTA CAGCTTGGTG 540 
161 KTMS MGR MRG VRWA KLY SLV 180 

541 ATCTGGGGGT GTACGCTGCT CCTGAGCTCA CCCATGCTGG TGTTCCGGAC CATGAAGGAG 600 
181 IWGC TLL LSS PMLV FRT MKE 200 

601 TACAGCGATG AGGGCCACAA CGTCACCGCT TGTGTCATCA GCTACCCATC CCTCATCTGG 660 
201 YSDE GHN VTA CVIS YPS LIW 220 



661 GAAGTGTTCA CCAACATGCT CCTGAATGTC GTGGGCTTCC TGCTGCCCCT GAGTGTCATC 720 
221 EVFT NML LNV VGFL LPL SVI 240 



721 ACCTTCTGCA CGATGCAGAT CATGCAGGTG CTGCGGAACA ACGAGATGCA GAAGTTCAAG 7 80 

241 TFCT MQI MQV LRNN EMQ KFK 260 

781 GAGATCCAGA CGGAGAGGAG GGCCACGGTG CTAGTCCTGG TTGTGC TGCT GCTATTCATC 840 

261 EIQT ERR ATV LVLV VLL L F I 280 



841 ATCTGCTGGC TGCCCTTCCA GATCAGCACC TTCCTGGATA CGCTGCATCG CCTCGGCATC 900 
281 ICWL PFQ 1ST FLDT LHR LGI 300 
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901 CTCTCCAGCT GCCAGGACGA GCGCATCATC GANGTAATCA CACAGATCGC CTCCTTCATG 9 60 
301 LSSC QDE RII DVIT QIA SFM 320 

961 GCCTACAGCA ACAGCTGCCT CAACCCACTG GTGTACGTGA TCGTGGGCAA GCGCTTCCGA 1020 
321 AYSN SCL NPL VYVI VGK RFR 340 

1021 AAGAAGTCTT GGGAGGTGTA CCAGGGAGTG TGCCAGAAAG NGGGCTGCAG GTCAGAACCC 1080 
341 KKSW EVY QGV CQKX GCR SEP 360 

10 81 ATTCAGATGG AGAACTCCAT GGGCAC AC TG CGGACCTCCA TCTCCGTGGA ACGCCAGATT 1140 
361 IQME NSM GTL RTSI SVE RQI 380 

1141 CACAAACTGC AGGACTGGGC AGGGAGCAGA CAGTGAGCAA ACGCCAGCAG GGCTGCTGTG 12 00 
381 HKLQ DWA GSR Q * 391 

12 01 AATTTGTGTA AGGATTGAGG GACAGTTGCT TTTCAGCATG GGCCCAGGAA TGCCAAGGAG 12 60 

12 61 ACATCTATGC ACGACCTTGG GAAATGAGTT GATGTCTCCG GTAAAACACC GGAGACTAAT 1320 
1321 TCCTGCCCTG CCCAATTTTG CAGGGAGCAT GGCTGTGAGG ATGGGGTGAA CTCACGCACA 13 80 

13 81 GCCAAGGACT CCAAAATCAC AACAGCATTA CTGTTCTTAT TTGCTGCCAC ACCTGAGCCA 1440 
1441 GCCTGCTCCT TCCCAGGAGT GGAGGAGGCC TGGGGGGAGG GAGAGGAGTG ACTGAGCTTC 1500 
15 01 CCTCCCGTGT GTTCTCCGTC CCTGCCCCAG CAAGACAACT TAGATCTCCA GGAGAACTGC 1560 
15 61 CATCCAGCTT TGGTGCAATG GCTGAGTGCA CAAGTGAGTT GTTGCCCTGG GTTTCTTTAA 1620 
1621 TCTATTCAGC TAGAACTTTG AAGGACAATT TCTTGCATTA ATAAAGGTTA AGCCCTGAGG 1680 
1681 GGTCCCTGAT AACAACCTGG AGACCAGGAT TTTATGGCTC CCCTCACTGA TGGACAAGGA 1740 
1741 GGTCTGTGCC AAAGAAGAAT CCAATAAGCA CATATTGAGC ACTTGCTGTA TATGCAGTAT 1800 
1801 TGAGCACTGT AGGCAAGACC CAAGAAAGAG AAGGAGCCAT CTCCATCTTG AAGGAACTCA 1860 
1861 AAGACTCAAG TGGGAACGAC TGGGCACTGC CACCACCAGA AAGCTGTTCG ACGAGACGGT 1920 
1921 CGAGCAGGGT GCTGTGGGTG ATATGGACAG CAGAAGGGGG AGACCAAGGT TCCAGCTCAA 198 0 
1981 CCAATAACTA TTGCACAACC ACCTGTCCCT GCCTCAGTTC CCTTTTATGT AACATGAAGT 2040 
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Figu: 

2041 CGTTGTGAGG GTTAAAGGCA GTAACAGGTA 
2101 GTACATGTGA GGCATCATTA CGCAGACGTA 
2161 ACTGAGGTCT AGAAATAGCT CCGTGGAGCA 
2221 AAGCACCAGT GTCTGGCACA CAGTAGGTGC 
22 81 CCACCCTGAG GCCCCAACCG CCACACACAC 
2341 AAAGTCTGAT TTGTGATGAG GCAGAGGAAG 
2401 ACAGTGCTGA GACCCCCCAC CACCAGCCGG 
2461 TCAGGGACTG TTCCTGTCTC AGCAACCAAG 
2 521 GAAGGTGGCC CAGTATGAGC CCTAGAAGAG 
2 581 TCGGCAGTGC CAGGGCAGCA CTCATTCACT 
2641 GAGCTAGAAC CTGGAGAGCT AGAACCTGGA 
27 01 GAGGCTAGAA CCAAGAAGGG CTAGAACCTG 
27 61 TGAGCTAGAA GCTGGAGGAC TAGAACCTGG 

2 821 AGGGCTGGAA TCTGGAGAGC TAGAACCTGG 
2881 AAGGGC T AG A AC CTGGAGGG CTGGAATCTG 
2941 GAGGGCTAGA ACCTAGAAGG GCTAGAACCT 
3001 AGAAGGGCTA GAACCTGGAG AGCCAGAACC 

3 061 CTGTAGAGCT AGAACATGGA GAGCTAGAAC 
3121 CTGGAGGGAA TGAACCTGGA GGGCTAGAAC 
3181 GAGCCCATAA ATCCTGACCA ATCCAACTCT 



re 38C 

TAAAGTACTT AGAAAAGCAA AGGGTGCTAC 2100 
ACTGGGATAT GTTTACTATA AGGAAAAGAC 2160 
GAATCAGTAT TGGGAGCCGG TGGCGGTGTG 2 220 
TCATTGGCTC CCTTCCACCT GTCATTCCCA 22 80 
AGGAGC ATT T GGAGAGAAGG CCATGTCTTC 23 40 
ATATTTCTAA TCGGTCTTGC CCAGAGGATC 2400 
TACCTGGGAA GGGGGAGAGT GCAGGCCTGC 2460 
GGATTGTTCC TGTCAATCAA TGGTTTATTG 2520 
TGTGAAAAGG AATGGCAATG GTGTTCACCA 25 80 
TGATAAATGA ATATTTATTA GCTGGTTGGA 2640 
GAACTAGAAC CTGGAGGGCT AGAACCTGGA 2700 
GAGGGGCTAG AACCTAGAGA AGCTAAAACC 27 60 
AGGGCTGGAA TCTGAAGGGC TAGAACCTGG 2 82 0 
AGGGCTAGAA CCTGGAGGGC TAGAACCTAG 2880 
GAGAGCTAGA ACCTGGAGGG CTAGAACCTG 2 940 
GGAGGGCTAG AACCTGGCAG GTTAGAACCT 3 0 00 
TGGAGGGCTA GAACCTGGAA GGGCTAGAAC 30 60 
CCGGCAGGCT AGAACCTGGC AAGCTAGAAC 3120 
CTGGAGAATG AGAAAAATTT ACATGGCAAA 3180 
GAATTTTAAA GCAAAAGCGT GAAAAAAAAG 3240 
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Figure 38D 

3241 ATTCCCTCCT TACCCCCAAC CCACTCTTTT TTCCCACCAC CCACTCTCCT CTGCCTCAGT 3 30 0 
3 301 AAGTATCTGG AGGAAGAAAA CAGGTGAAAG AAGAAGTAAA AACCATTTAG TATTAGTATT 33 60 
33 61 AGAATGAAGT C AAAC TGTGC CACACATGGT GAATGAAAAA AAAAAAAAAG AGGCTGTGTT 3420 
3421 TTGTCACACA GGGCAGTCAT TCAGCACCAG AGC AC GTGAT GGTCTGAGAC TCTCTTAGGA 3480 
3481 GCAGAGCTCT GCCGCAATGG CCATGTGGGG ATCCACACCT GGTCTGAGGG GCAAC TGAGT 3 540 
3 541 CTGCGGGAGA AGAGCGGCCC TATGCATGGT GTAGATGCCC TGATAAAGAA CATCTGTCCT 36 00 
36 01 GTGAAAGACT CAATGAGCTG TTATGTTGTA AACAGGAAGC ATTTCACATC CAAACGAGAA 3660 
3661 AATCATGTAA AC ATGTGTC T TTTCTGTAGA GCATAATAAA TGGATGAGGT TTTTGCAAAA 3720 
3721 AAAAAAAAAA AAA 3733 
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Figure 41A 



1 CACCCTATCC TACACTACTA GGAACTTGCA CAGTCCGCCT CGGGCAGCCC AAAGCTCCTC 60 



61 TGCCCACCCT GGCTCCCAAA ACCCTCCAAA AC AAAAGAC C AGAAAAGCAC TCTCCACCCA 12 0 



121 GCAGCCAAAC GCCTCCTTCT TGACGCCAGC CCCCACCCTC TGTCTGCTCG AGCCCAGGAA 18 0 



181 AGGCCTGAAG GAACAGGCCG GGGAAGGAGC CCTCCCTCTC TCCCTTGTCC CTCCATCCAC 240 



241 CCAGCGCCGG CATCTGGAGA CCCTATGGCC CGGGCTCACT GGGGCTGCTG CCCCTGGCTG 300 
1 MARAHWGCCPWL12 



3 01 GTCCTCCTCT GTGCTTGTGC CTGGGGCCAC AC AAAGC C AC TGGACCTTGG AGGGCAGGAT 360 
13 V L L C ACA WGH TKPL DLG GQD 32 



3 61 GTGAGAAATT GTTCCACCAA CCCCCCTTAC CTTCCAGTTA CTGTGGTCAA TACCACAATG 420 
33VRNC STN PPY LPVT VVN TTM 52 



421 TCACTCACAG CCCTCCGCCA GCAGATGCAG ACCCAGAATC TCTCAGCCTA CATCATCCCA 480 

53SLTA LRQ QMQ TQNL SAY IIP 72 

481 GGCACAGATG CTCACATGAA CGAGTACATC GGCCAACATG ACGAGAGGCG TGCGTGGATT 540 

73GTDA HMN EYI GQHD ERR AWI 92 

541 ACAGGCTTTA CAGGGTCTGC AGGAACTGCA GTGGTGACTA TGAAGAAAGC AGCTGTCTGG 600 

93TGFT GSA GTA VVTM KKA AVW 112 



601 ACCGACAGTC GCTACTGGAC TCAGGCTGAG CGGCAAATGG ACTGTAATTG GGAGCTCCAT 660 
113 TDSR YWT QAE RQMD CNW ELH 132 



661 AAGGAAGTTG GCACCACTCC TATTGTCACC TGGCTCCTCA CCGAGATTCC TGCTGGAGGG 72 0 
133 KEVG TTP IVT WLLT EIP AGG 152 



721 CGTGTGGGTT TTGACCCCTT CCTCTTGTCC ATTGACACCT GGGAGAGTTA TGATCTGGCC 780 
153 RVGF DPF LLS IDTW ESY DLA 172 



781 CTCCAAGGCT CTAACAGACA GCTGGTGTCC ATCACAACCA ATCTTGTGGA CCTGGTATGG 840 

173 LQGS NRQ LVS ITTN LVD LVW 192 

841 GGATCAGAGA GGCCACCGGT TCCAAATCAA CCCATTTATG CCCTGCAGGA GGCATTCACA 900 

193 GSER PPV PNQ PIYA LQE AFT 212 

901 GGGAGCACTT GGCAGGAGAA AGTATCTGGC GTCCGAAGCC AGATGCAGAA GCATCAAAAG 960 

213 GSTW QEK VSG VRSQ MQK HQK 232 
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Figure 4 IB 



9 61 GTCCCGACTG CCGTCCTTCT GTCGGCGCTT GAGGAGACGG CCTGGCTCTT CAACCTTCGA 102 0 
233 VPTA VLL SAL EETA WLF NLR 252 



1021 GCCAGTGACA TCCCCTATAA CCCCTTCTTC TATTCCTACA CGCTGCTCAC AGACTCTTCT 10 80 
253 ASDI PYN PFF YSYT LLT DSS 272 



1081 ATTAGGTTGT TTGCAAACAA GAGTCGCTTT AGCTCCGAAA CCTTGAGCTA TCTGAACTCC 1140 
273 IRLF ANK SRF SSET LSY LNS 292 



1141 AGTTGCACAG GCCCCATGTG TGTGCAAATC GAGGATTACA GCCAAGTTCG TGACAGCATC 12 00 
293 SCTG PMC VQI EDYS QVR DSI 312 



1201 CAGGCCTACT CATTGGGAGA TGTGAGGATC TGGATTGGGA CCAGCTATAC CATGTATGGG 12 60 

313QAYS LGD VRI WIGT SYT MYG 332 

12 61 ATCTATGAAA TGATACCAAG GGAGAAACTC GTGACAGACA CCTACTCCCC AGTGATGATG 132 0 

333 IYEM IPR EKL VTDT YSP VMM 352 



13 21 ACCAAGGCAG TGAAGAACAG CAAGGAGCAG GCCCTCCTCA AGGCCAGCCA CGTGCGGGAC 13 80 
353 TKAV KNS KEQ ALLK ASH VRD 372 



13 81 GCTGTGGCTG TGATCCGGTA CTTGGTCTGG CTGGAGAAGA ACGTGCCCAA AGGCACAGTG 1440 

373 AVAV IRY LVW LEKN VPK GTV 392 

1441 GATGAGTTTT CGGGGGCAGA GATCGTGGAC AAGTTC C GAG GAGAAGAACA GTTCTCCTCC 15 00 

393 DEFS GAE IVD KFRG EEQ FSS 412 



1501 GGACCCAGTT TTGAAACCAT CTCTGCTAGT GGTTTGAATG CTGCCCTGGC CCACTACAGC 1560 
413 GPSF ETI SAS G L N A ALA HYS 432 



1561 CCGACCAAGG AGCTGAACCG CAAGCTGTCC TCAGATGAGA TGTACCTGCT GGACTCTGGG 162 0 
433 PTKE LNR KLS SDEM YLL DSG 452 



1621 GGGCAGTACT GGGACGGGAC CACAGACATC ACCAGAACAG TCCACTGGGG CACCCCCTCT 1680 

453 GQYW DGT TDI TRTV HWG TPS 472 

1681 GCCTTTCAGA AGGAGGCATA TACCCGTGTG CTGATAGGAA ATATTGACCT GTCCAGGCTC 1740 

473 AFQK EAY TRV LIGN IDL SRL 492 

1741 ATCTTTCCCG CTGCTACATC AGGGCGAATG GTGGAGGCCT TTGCCCGCAG AGCCTTGTGG 1800 

493 IFPA ATS GRM VEAF ARR ALW 512 



1801 GATGCTGGTC TCAATTATGG TCATGGGACA GGCCACGGCA TTGGCAACTT CCTGTGTGTG 1860 
513 DAGL NYG HGT GHGI GNF LCV 532 
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1861 CATGAGTGGC CAGTGGGATT CCAGTCCAAC AACATCGCTA TGGC C AAGGG CATGTTCACT 192 0 
533 HEWP VGF QSN NIAM AKG MFT 552 

1921 TCCATTGAAC CTGGTTACTA TAAGGATGGA GAATTTGGGA TCCGTCTCGA AGATGTGGCT 19 80 
553 SIEP GYY KDG EFGI RLE D V A 572 

19 81 CTCGTGGTAG AAGCAAAGAC CAAGTACCCA GGGGAGC T AC CTGACCTTGT GGTATCATTT 2040 
573 LVVE AKT KYP GELP DLV VSF 592 

2041 GTGCCCTATG ACCGGAACCT CATCGATGTC AGCCTGCTGT CTCCCGAGCA TCTCCAGTAC 2100 
593 VPYD RNL IDV SLLS PEH LQY 612 

2101 CTGAATCGCT ACTACCAGAC CATCCGGGAG AAGGTGGGTC CAGAGCTGCA GAGGCGCCAG 2160 
613 LNRY YQT IRE KVGP ELQ RRQ 632 

2161 CTACTAGAGG AGTTCGAGTG GCTTCAACAG CACACAGAGC CCCTGGCCGC CAGGGCCCCA 2220 
633 LLEE FEW LQQ HTEP L A A RAP 652 

2221 GACACCGCCT CCTGGGCCTC TGTGTTAGTG GTCTCCACCC TTGCCATCCT TGGCTGGAGT 2280 
653 DTAS WAS VLV VSTL AIL GWS 672 

22 81 GTCTAGAGGC TCCAGACTCT CCTGTTAACC CTCCATCTAG ATGGGGGGCT CCCTTGCTTA 2340 
673 V * 673 

2341 GCTCCCCTCA CCCTGCACTG AACATACCCC AAGAGCCCCT GCTGGCCCAT TGCCTAGAAA 2400 

24 01 CCTTTGCATT CATCCTCCTT CTCCAAGACC TATGGAGAAG GTCCCAGGCC CCAGGAAACA 2460 
2461 CAGGGCTTCT TGGCCCCAGA TGGCACCTCC CTGCACCCCG GGGTTGTATA CCACACCCTG 2520 

25 21 GGCCCCTAAT CCCAGGCCCC GAAATAGGAA AGCCAGCTAG TCTCTTCTCT TCTGTGATCT 25 80 
25 81 CAGTAGGCCT AACCTATAAC CTAACACAGA CTGCTACAGC TGCTCCCCTC CCGCCAAACA 2640 
2 641 AAGCCCCAAG AAAACAATGC CCCTACCACC CAAGGGTGCC ATGGTCCCGG GAAAACCCAA 27 00 
27 01 CCTGTCACCG CGTGTTGGGC GTAACCAGAA CTGTTCCCCC CCACCAGGGC TTAAAAATCG 27 60 
27 61 CCCCCACTTT TTAACCATCG TCCATTAACC ACCTGGTGGG CATAGCCAGA GCTGTTCGAA 2820 
2821 CCCAGCCAGG GATGAAAAAT CAACCCCCGA CATGGAACCC ATGATTCCTA AACCCGGGGT 28 80 
2881 AGGTTCCATG CCAAGTAACA GCAGAGGGAG TTAAGCCATA GGAATTTGGC TGTGGAGTAA 2940 
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Figure 4 ID 

2941 GAGGGAATGC GGTGAGGCAG TGTGGAATAT GACCCTACCA GAGGTTGGAG AACAAACTTG 30 00 
3 001 GGCAGCCGGA ACCCGTCACT ATTTTAGATT CCTGGCATTC GAGGAGCCCT TTGAACTTTC 3 0 60 
3 061 CAAAGTGCAG CCACAGCTAC AATGCTGTTA AATCCTCCCA CATTTCTTGG ATGCCCCTTC 3120 
3121 ACCTTGTGTG GACAGTGTCT GGTTTCCCCA TTTTACAGAC AGGAAAACTG AGCTTCAGAC 3180 
3181 AGGGGGTGGG CTTTGCCTAA GGACACACAA ATTTGGTTGG GAGTTGATGG GGCCAGATGA 3240 
3241 GCCAGCATTC CAGCTGTTTC ACCCTTCAGC AACATGCAGA GTCCCTGAGC CCACCTCCCA 33 00 
33 01 GCCCTCTCCT CATTCTCTGA ACCCACTGTG GTGAGAAGAA TTTGCTCCGG CCAAATTGGC 33 60 
33 61 CGTTAGCCAC CTGGGTCCAC ATCCTGCTAA GACGTTTAAA ACAGCCTAAC AAAGACACTT 3420 
3421 GCCTGTGG 3428 
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Figure 42A 

1 CTGTGCATGG CATCATCCTG GCCCCCTCTA GAGCTCCAAT CCTCCAACCA GAGCCAGCTC 60 

1 MA SSW PPL ELQS SNQ SQL 18 

61 TTCCCTCAAA ATGCTACGGC CTGTGACAAT GCTCCAGAAG CCTGGGACCT GCTGCACAGA 12 0 

19FPQN ATA CDN APEA WDL LHR 38 

121 GTGCTGCCGA CATTTATCAT CTCCATCTGT TTCTTCGGCC TCCTAGGGAA CCTTTTTGTC 180 

3 9V L P T FII SIC FFGL LGN LFV 58 



181 CTGTTGGTCT TCCTCCTGCC CCGGCGGCAA CTGAACGTGG CAGAAATCTA CCTGGCCAAC 240 
59LLVF LLP RRQ LNVA E I Y LAN 78 

241 CTGGCAGCCT CTGATCTGGT GTTTGTCTTG GGCTTGCCCT TCTGGGCAGA GAATATCTGG 3 00 
79 L A A S DLV FVL GLPF WAE NIW 98 

3 01 AACCAGTTTA ACTGGCCTTT CGGAGCCCTC CTCTGCCGTG TCATCAACGG GGTCATCAAG 3 60 
99NQFN WPF GAL LCRV I N G VIK 118 

3 61 GCCAATTTGT TCATCAGCAT CTTCCTGGTG GTGGCCATCA GCCAGGACCG CTACCGCGTG 420 
119ANLF ISI FLV VAIS QDR YRV 138 



421 CTGGTGCACC CTATGGCCAG CGGAAGGCAG CAGCGGCGGA GGCAGGCCCG GGTCACCTGC 480 

139LVHP MAS GRQ QRRR QAR VTC 158 

481 GTGCTCATCT GGGTTGTGGG GGGCCTCTTG AGCATCCCCA CATTCCTGCT GCGATCCATC 540 

159VLIW VVG GLL SIPT FLL RSI 178 



541 CAAGCCGTCC CAGATCTGAA CATCACCGCC TGCATCCTGC TCCTCCCCCA TGAGGCCTGG 600 

179 QAVP DLN I T A CILL LPH EAW 198 

601 CACTTTGCAA GGATTGTGGA GTTAAATATT CTGGGTTTCC TCCTACCACT GGCTGCGATC 660 

199 HFAR IVE LNI LGFL LPL A A I 218 



661 GTCTTCTTCA ACTACCACAT CCTGGCCTCC CTGCGAACGC GGGAGGAGGT CAGCAGGACA 72 0 
219 VFFM YHI LAS LRTR EEV SRT 238 



721 AGAGTGCAGG GGC CGAAGGA TAGCAAGACC ACAGCGCTGA TCCTCACGCT CGTGGTTGCC 780 

239 RVQG PKD SKT TALI LTL VVA 258 

781 TTCCTGGTCT GCTGGGCCCC TTACCACTTC TTTGCCTTCC TGGAATTCTT ATTCCAGGTG 840 

259 FLVC WAP YHF FAFL EFL FQV 278 



841 CAAGCAGTCC GAGGCTGCTT TTGGGAGGAC TTCATTGACC TGGGCCTGCA ATTGGCCAAC 900 
279 QAVR GCF WED FIDL GLQ LAN 298 
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Figure 42B 

9 01 TTCTTTGCCT TCACTAACAG CTCCCTGAAT CCAGTAATTT ATGTCTTTGT GGGCCGGCTC 96 0 
299 FFAF TNS SLN PVIY VFV GRL 318 

9 61 TTCAGGACCA AGGTCTGGGA AC TTT AT AAA CAATGCACCC CTAAAAGTCT TGCTCCAATA 1020 
319 FRTK VWE LYK QCTP KSL API 338 



1021 TCTTCATCCC ATAGGAAAGA AATCTTCCAA CTTTTCTGGC GGAATTAAAA CAGCATTGAA 1080 
339 SSSH RKE I F Q LFWR N * 353 



1081 CC 1082 
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Figure 43A 

1 ATGTTCTCTC CCTGGAAGAT ATCAATGTTT CTGTCTGTTC GTGAGGCCTC CGTGCCCACC 60 
1MFSP WKI SMF LSVR E A S VPT 20 

61 ACGGCCTCTT TCAGCGCCGA CATGCTCAAT GTCACCTTGC AAGGGCCCAC TCTTAACGGG 12 0 
21TASF SAD MLN VTLQ GPT LNG 40 



121 ACCTTTGCCC AGAGCAAATG CCCCCAAGTG GAGTGGCTGG GCTGGCTCAA CACCATCCAG 18 0 
41TFAQ SKC PQV EWLG WLN T I Q 60 



181 CCCCCCTTCC TCTGGGTGCT GTTCGTGCTG GCCACCCTAG AGAACATCTT TGTCCTCAGC 240 
61PPFL WVL FVL ATLE NIF VLS 80 



241 GTCTTCTGCC TGCACAAGAG CAGCTGCACG GTGGCAGAGA TCTACCTGGG GAACCTGGCC 30 0 
81VFCL HKS SCT VAEI YLG NLA 100 



3 01 GCAGCAGACC TGATCCTGGC CTGCGGGCTG CCCTTCTGGG CCATCACCAT CTCCAACAAC 360 
101 AADL I L A CGL PFWA ITI SNN 120 



3 61 TTCGACTGGC TCTTTGGGGA GACGCTCTGC CGCGTGGTGA ATGCCATTAT CTCCATGAAC 42 0 
121 FDWL FGE TLC RVVN All SMN 140 



421 CTGTACAGCA GCATCTGTTT CCTGATGCTG GTGAGCATCG ACCGCTACCT GGCCCTGGTG 480 

141 LYSS ICF LML VSID RYL A L V 160 

481 AAAACCATGT CCATGGGCCG GATGCGCGGC GTGCGCTGGG CCAAGCTCTA CAGCTTGGTG 540 

161 KTMS MGR MRG VRWA KLY SLV 180 

541 ATCTGGGGGT GTACGCTGCT CCTGAGCTCA CCCATGCTGG TGTTCCGGAC CATGAAGGAG 600 

181 IWGC TLL LSS PMLV FRT MKE 200 



601 TACAGCGATG AGGGCCACAA CGTCACCGCT TGTGTCATCA GCTACCCATC CCTCATCTGG 660 
201 YSDE GHN VTA CVIS YPS LIW 220 



661 GAAGTGTTCA CCAACATGCT CCTGAATGTC GTGGGCTTCC TGCTGCCCCT GAGTGTCATC 72 0 
221 EVFT NML LNV VGFL LPL SVI 240 



721 ACCTTCTGCA CGATGCAGAT CATGCAGGTG CTGCGGAACA ACGAGATGCA GAAGTTCAAG 780 
241 TFCT MQI MQV LRNN EMQ KFK 260 



7 81 GAGATCCAGA CGGAGAGGAG GGCCACGGTG CTAGTCCTGG TTGTGCTGCT GCTATTCATC 840 
261 EIQT ERR A T V LVLV VLL L F I 280 



841 ATCTGCTGGC TGCCCTTCCA GATCAGC AC C TTCCTGGATA CGCTGCATCG CCTCGGCATC 900 
281 ICWL PFQ 1ST FLDT LHR LGI 300 
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901 CTCTCCAGCT GCCAGGACGA GCGCATCATC GATGTAATCA CACAGATCGC CTCCTTCATG 96 0 
301 LSSC QDE RII DVIT Q I A SFM 320 

961 GCCTACAGCA ACAGCTGCCT CAACCCACTG GTGTACGTGA TCGTGGGCAA GCGCTTCCGA 1020 
321 AYSN SCL NPL VYVI VGK RFR 340 

1021 AAGAAGTCTT GGGAGGTGTA CCAGGGAGTG TGCCAGAAAG GGGGCTGCAG GTCAGAACCC 1080 
341KKSW EVY QGV CQKG GCR SEP 360 

1081 ATTCAGATGG AGAACTCCAT GGGC AC AC TG CGGACCTCCA TCTCCGTGGA ACGCCAGATT 1140 
361 IQME NSM GTL RTSI SVE R Q I 380 

1141 CACAAACTGC AGGACTGGGC AGGGAGCAGA CAGTGAGCAA ACGCCAGCAG GGCTGCTGTG 1200 
381 HKLQ DWA GSR Q * 391 

12 01 AATTTGTGTA AGGATTGAGG GACAGTTGCT TTTCAGCATG GGCCCAGGAA TGCCAAGGAG 12 60 

12 61 ACATCTATGC ACGACCTTGG GAAATGAGTT GATGTCTCCG GTAAAACACC GGAGACTAAT 1320 
1321 TCCTGCCCTG CCCAATTTTG CAGGGAGCAT GGCTGTGAGG ATGGGGTGAA CTCACGCACA 13 80 

13 81 GCCAAGGACT CCAAAATCAC AACAGCATTA CTGTTCTTAT TTGCTGCCAC ACCTGAGCCA 1440 
1441 GCCTGCTCCT TCCCAGGAGT GGAGGAGGCC TGGGGGGAGG GAGAGGAGTG ACTGAGCTTC 15 00 
1501 CCTCCCGTGT GTTCTCCGTC CCTGCCCCAG CAAGACAACT TAGATCTCCA GGAGAACTGC 1560 
1561 CATCCAGCTT TGGTGCAATG GCTGAGTGCA CAAGTGAGTT GTTGCCCTGG GTTTCTTTAA 1620 
1621 TCTATTCAGC TAGAACTTTG AAGGACAATT TCTTGCATTA ATAAAGGTTA AGCCCTGAGG 1680 
1681 GGTCCCTGAT AACAACCTGG AGACCAGGAT TTTATGGCTC CCCTCACTGA TGGACAAGGA 1740 
1741 GGTCTGTGCC AAAGAAGAAT CCAATAAGCA CATATTGAGC ACTTGCTGTA TATGCAGTAT 1800 
1801 TGAGC AC TGT AGGCAAGACC CAAGAAAGAG AAGGAGC CAT CTCCATCTTG AAGGAACTCA 1860 

18 61 AAGACTCAAG TGGGAACGAC TGGGCACTGC CACCACCAGA AAGCTGTTCG ACGAGACGGT 1920 
1921 CGAGCAGGGT GCTGTGGGTG ATATGGACAG CAGAAGGGGG AG AC C AAGGT TCCAGCTCAA 19 80 

19 81 CCAATAACTA TTGCACAACC ACCTGTCCCT GCCTCAGTTC CCTTTTATGT AACATGAAGT 2040 
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Figu: 

2 041 CGTTGTGAGG GTTAAAGGCA GTAACAGGTA 
2101 GTACATGTGA GGCATCATTA CGCAGACGTA 
2161 ACTGAGGTCT AGAAATAGCT CCGTGGAGCA 
2221 AAGCACCAGT GTCTGGCACA CAGTAGGTGC 
2281 CCACCCTGAG GCCCCAACCG CCACACACAC 
2341 AAAGTCTGAT TTGTGATGAG GCAGAGGAAG 
2401 ACAGTGCTGA GACCCCCCAC CACCAGCCGG 
2461 TCAGGGACTG TTCCTGTCTC AGCAACCAAG 
2521 GAAGGTGGCC CAGTATGAGC CCTAGAAGAG 
2581 TCGGCAGTGC CAGGGCAGCA CTCATTCACT 
2641 GAGCTAGAAC CTGGAGAGCT AGAACCTGGA 
2701 GAGGCTAGAA CCAAGAAGGG CTAGAACCTG 
27 61 TGAGCTAGAA GCTGGAGGAC TAGAAC CTGG 
2821 AGGGCTGGAA TCTGGAGAGC TAGAAC C TGG 
2881 AAGGGCTAGA ACCTGGAGGG CTGGAATCTG 
2941 GAGGGCTAGA ACCTAGAAGG GCTAGAACCT 

3 001 AGAAGGGCTA GAACCTGGAG AGCCAGAACC 
3061 CTGTAGAGCT AGAACATGGA GAGCTAGAAC 
3121 C TGGAGGGAA TGAACCTGGA GGGCTAGAAC 
3181 GAGCCCATAA ATCCTGACCA ATC CAACTCT 



re 43C 

TAAAGTACTT AGAAAAGCAA AGGGTGCTAC 2100 
AC TGGGAT AT GTTTACTATA AGGAAAAGAC 2160 
GAATCAGTAT TGGGAGCCGG TGGCGGTGTG 2220 
TCATTGGCTC CCTTCCACCT GTCATTCCCA 22 80 
AGGAGCATTT GGAGAGAAGG CCATGTCTTC 2340 
ATATTTCTAA TCGGTCTTGC CCAGAGGATC 2400 
TACCTGGGAA GGGGGAGAGT GCAGGCCTGC 2460 
GGATTGTTCC TGTCAATCAA TGGTTTATTG 2 520 
TGTGAAAAGG AATGGCAATG GTGTTCACCA 2580 
TGATAAATGA ATATTTATTA GCTGGTTGGA 2 640 
GAACTAGAAC CTGGAGGGCT AGAACCTGGA 2700 
GAGGGGCTAG AACCTAGAGA AGCTAAAACC 27 60 
AGGGCTGGAA TCTGAAGGGC TAGAACCTGG 2 820 
AGGGCTAGAA CCTGGAGGGC TAGAACCTAG 2880 
GAGAGCTAGA ACCTGGAGGG CTAGAACCTG 2 940 
GGAGGGCTAG AACCTGGCAG GTTAGAACCT 3000 
TGGAGGGCTA GAACCTGGAA GGGCTAGAAC 3 060 
CCGGCAGGCT AGAACCTGGC AAGCTAGAAC 3120 
C TGGAGAATG AGAAAAATTT ACATGGCAAA 3180 
GAATTTTAAA GCAAAAGCGT GAAAAAAAAG 32 40 
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Figure 4 3D 

32 41 ATTCCCTCCT TACCCCCAAC CCACTCTTTT TTCCCACCAC CCACTCTCCT CTGCCTCAGT 3300 

33 01 AAGTATCTGG AGGAAGAAAA CAGGTGAAAG AAGAAGTAAA AACCATTTAG TATTAGTATT 33 60 
33 61 AGAAT GAAGT CAAACTGTGC CACACATGGT GAATGAAAAA AAAAAAAAAG AGGCTGTGTT 3420 
3421 TTGTCACACA GGGCAGTCAT TCAGCACCAG AGCACGTGAT GGTCTGAGAC TCTCTTAGGA 3480 
3481 GCAGAGCTCT GCCGCAATGG CCATGTGGGG ATCCACACCT GGTCTGAGGG GCAACTGAGT 3 540 
3541 CTGCGGGAGA AGAGCGGCCC TATGCATGGT GTAGATGCCC TGATAAAGAA CATCTGTCCT 3600 
3601 GTGAAAGACT CAATGAGCTG TTATGTTGTA AACAGGAAGC ATTTCACATC CAAACGAGAA 3660 
3661 AATCATGTAA ACATGTGTCT TTTCTGTAGA GCATAATAAA TGGATGAGGT TTTTGCAAAA 3720 
3721 AAAAAAAAAA AAA 3733 
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Figure 44A 

1 ATGCATCTTATCGACTACCTGCTCCTCCTGCTGGTTGGACTACTGGCCCTTTCTCATGGC 6 0 

1MHLIDYLLLLLVGLLALSHG 20 

61 CAGCTGCACGTTGAGCATGATGGTGAGAGTTGCAGTAACAGCTCCCACCAGCAGATTCTG 12 0 

21QLHVEHDGESCSNSSHQQIL 40 



121 GAGACAGGTGAGGGCTCCCCCAGCCTCAAGATAGCCCCTGCCAATGCTGACTTTGCCTTC 180 
41 ETGEGS PSLKIAPANADFAF 60 



181 CGCTTCTACTACCTGATCGCTTCGGAGACCCCGGGGAAGAACATCTTTTTCTCCCCGCTG 2 4 0 
61RFYYLIASETPGKNIFFSPL 80 



241 AGCATCTCGGCGGCCTACGCCATGCTTTCCCTGGGGGCCTGCTCACACAGCCGCAGCCAG 3 00 
81SISAAYAMLSLGACSHSRSQ 100 



301 ATCCTTGAGGGCCTGGGCTTCAACCTCACCGAGCTGTCTGAGTCCGATGTCCATAGGGGC 3 60 
101 I LEGLGFNLTELSESDVHRG 120 



361 TTCCAGCACCTCCTGCACACTCTCAACCTCCCCGGCCATGGGCTGGAAACACGCGTGGGC 42 0 

121 FQHLLHTLNLPGHGLETRVG 140 

421 AGTGCTCTGTTCCTGAGCCACAACCTGAAGTTCCTTGCAAAATTCCTGAATGACACCATG 480 

141 SALFLSHNLKFLAKFLNDTM 160 



481 GCCGTCTATGAGGCTAAACTCTTCCACACCAACTTCTACGACACTGTGGGCACAATCCAG 540 
161 AVYEAKLFHTNFYDTVGTIQ 180 



541 CTTATCAACGACCACGTCAAGAAGGAAACTCGAGGGAAGATTGTGGATTTGGTCAGTGAG 600 
181 LINDHVKKETRGKIVDLVSE 200 



601 CTCAAGAAGGACGTCTTGATGGTGCTGGTGAATTACATTTACTTCAAAGCCCTGTGGGAG 6 60 
201 LKKDVLMVLVNYIYFKALWE 220 



661 AAACCATTCATTTCCTCAAGGACCACTCCCAAAGACTTCTATGTTGATGAGAACACAACA 72 0 
221 KPFISSRTTPKDFYVDENTT 240 



721 GTCCGGGTGCCCATGATGCTGCAGGACCAGGAGCATCACTGGTATCTTCATGACAGATAC 7 80 
241 VRVPMMLQDQEHHWYLHDRY 260 



781 TTGCCCTGCTCGGTGCTACGGATGGATTACAAAGGAGACGCAACCGTGTTTTTCATTCTC 840 
261 LPCSVLRMDYKGDATVFFIL 280 
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Figure 44B 

841 CCTAACCAAGGCAAAATGAGGGAGATTGAAGAGGTTCTGACTCCAGAGATGCTAATGAGG 900 

281 PNQGKMREIEEVLTPEMLMR 300 

901 TGGAACAACTTGTTGCGGAAGAGGAATTTTTACAAGAAGCTAGAGTTGCATCTTCCCAAG 9 60 

301 WNNLLRKRNFYKKLELHLPK 320 



9 61 TTCTCCATTTCTGGCTCCTATGTATTAGATCAGATTTTGCCCAGGCTGGGCTTCACGGAT 1020 
321 FSISGSYVLDQILPRLGFTD 340 



1021 CTGTTCTCCAAGTGGGCTGACTTATCCGGCATCACCAAACAGCAAAAACTGGAGGCATCC 1080 

341 LFSKWADLSGITKQQKLEAS 360 

10 81 AAAAGTTTCCACAAGGCCACCTTGGACGTGGATGAGGCTGGC ACCGAGGCTGCAGCAGCC 1140 

361 KSFHKATLDVDEAGTEAAAA 380 

1141 ACGACGTTCGCGATCAAATTCTTCTCTGCCCAGACCAATCGCCACATCCTGCGATTCAAC 12 00 

381 TTFAIKFFSAQTNRHILRFN 400 

12 01 CGGCCCTTCCTTGTGGTGATCTTTTCCACCAGCACCCAGAGTGTCCTCTTTCTGGGCAAG 12 60 

401 RPFLVVIFSTSTQSVLFLGK 420 



1261 GTCGTCGACCCCACGAAACCATAG 1284 
421 VVDPTKP* 428 



D0053 NP 



Figure 45A 



1 ATGCATCTTATCGACTACCTGCTCCTCCTGCTGGTTGGACTACTGGCCCTTTCTCATGGC 6 0 
1MHLIDYLLLLLVGLLALSHG 20 



6 1 CAGCTGCACGTTGAGCATGATGGTGAGAGTTGCAGTAACAGCTCCCACCAGCAGATTCTG 120 

21QLHVEHDGESCSNSSHQQIL 40 

121 GAGACAGGTGAGGGCTCCCCCAGCCTCAAGATAGCCCCTGCCAATGCTGACTTTGCCTTC 18 0 

41 ETGEGSPSLKIAPANADFAF 60 



181 CGCTTCTACTACCTGATCGCTTCGGAGACCCCGGGGAAGAACATCTTTTTCTCCCCGCTG 240 
61RFYYLIASETPGKNIFFSPL 80 



241 AGCATCTCGGCGGCCTACGCCATGCTTTCCCTGGGGGCCTGCTCACACAGCCGCAGCCAG 3 00 

81SISAAYAMLSLGACSHSRSQ 100 

301 ATCCTTGAGGGCCTGGGCTTCAACCTCACCGAGCTGTCTGAGTCCGATGTCCATAGGGGC 3 60 

101 ILEGLGFNLTELSESDVHRG 120 

3 61 TTCCAGCACCTCCTGCACACTCTCAACCTCCCCGGCCATGGGCTGGAAACATGCGTGGGC 420 

121 FQHLLHTLNLPGHGLETCVG 140 

421 AGTGCTCTGTTCCTGAGCCACAACCTGAAGTTCCTTGCAAAATTCCTGAATGACACCATG 4 80 

141 SALFLSHNLKFLAKFLNDTM 160 



481 GCCGTCTATGAGGCTAAACTCTTCCACACCAACTTCTACGACACTGTGGGCACAATCCAG 540 
161 AVYEAKLFHTNFYDTVGTIQ 180 



541 CTTATCAACGACCACGTCAAGAAGGAAACTCGAGGGAAGATTGTGGATTTGGTCAGTGAG 600 
181 LINDHVKKETRGKIVDLVSE 200 



601 CTCAAGAAGGACGTCTTGATGGTGCTGGTGAATTACATTTACTTCAAAGCCCTGTGGGAG 660 
201 LKKDVLMVLVNYIYFKALWE 220 



6 61 AAACCATTCATTTCCTCAAGGACCACTCCCAAAGACTTCTATGTTGATGAGAACACAACA 72 0 
221 KPFISSRTTPKDFYVDENTT 240 



721 GTCCGGGTGCCCATGATGCTGCAGGACCAGGAGCATCACTGGTATCTTC ATGACAGATAC 7 80 
241 VRVPMMLQDQEHHWYLHDRY 260 



781 TTGCCCTGCTCGGTGCTACGGATGGATTACAAAGGAGACGCAACCGTGTTTTTCATTCTC 840 
261 LPCSVLRMDYKGDATVFF I L 280 
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Figure 45B 



841 CCTAAC CAAGGCAAAATGAGGGAGATTGAAGAGGTTCTGACTCCAGAGATGCTAATGAGG 90 0 

281 PNQGKMREIEEVLTPEMLMR 300 

901 TGGAACAACTTGTTGCGGAAGAGGAATTTTTACAAGAAGCTAGAGTTGCATCTTCCCAAG 9 60 

301 WNNLLRKRNFYKKLELHLPK 320 



961 TTCTCCATTTCTGGCTCCTATGTATTAGATCAGATTTTGCCCAGGCTGGGCTTCACGGAT 102 0 
321 FSISGSYVLDQILPRLGFTD 340 



1021 CTGTTCTCCAAGTGGGCTGACTTATCCGGCATCACCAAACAGCAAAAACTGGAGGCATCC 10 80 

341 LFSKWADLSGITKQQKLEAS 360 

1081 AAAAGTTTCCACAAGGCCACCTTGGACGTGGATGAGGCTGGCACCGAGGCTGCAGCAGCC 1140 

361 KSFHKATLDVDEAGTEAAAA 380 



1141 ACCACGTTCGCGATCAAATTCTTCTCTGCCCAGACCAATCGCCACATCCTGCGATTCAAC 12 00 
381 TTFAIKFFSAQTNRHILRFN 400 



1201 CGGCCCTTCCTTGTGGTGATCTTTTCCACCAGCACCCAGAGTGTCCTCTTTCTGGGCAAG 12 60 
401 RPFLVVIFSTSTQSVLFLGK 420 



12 61 GTCGTCGACCCCACGAAACCATAG 12 84 
421 VVDPTKP* 428 
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Figure 46A 



1 CACCCTATCC TACACTACTA GGAACTTGCA CAGTCCGCCT CGGGCAGCCC AAAGCTCCTC 60 



61 TGCCCACCCT GGCTCCCAAA ACCCTCCAAA ACAAAAGACC AGAAAAGCAC TCTCCACCCA 12 0 



121 GCAGCCAAAC GCCTCCTTCT TGACGCCAGC CCCCACCCTC TGTCTGCTCG AGCCCAGGAA 180 



181 AGGCCTGAAG GAACAGGCCG GGGAAGGAGC CCTCCCTCTC TCCCTTGTCC CTCCATCCAC 240 



241 CCAGCGCCGG CATCTGGAGA CCCTATGGCC CGGGCTCACT GGGGCTGCTG CCCCTGGCTG 3 00 
1 MARAHWGCCPWL12 

301 GTCCTCCTCT GTGCTTGTGC CTGGGGCCAC AC AAAGC C AC TGGACCTTGG AGGGCAGGAT 3 60 
13VLLC A C A WGH TKPL DLG GQD 32 

361 GTGAGAAATT GTTCCACCAA CCCCCCTTAC CTTCCAGTTA CTGTGGTCAA TACCACAATG 420 
33VRNC STN PPY LPVT VVN TTM 52 



421 TCACTCACAG CCCTCCGCCA GCAGATGCAG ACCCAGAATC TCTCAGCCTA CATCATCCCA 480 
53SLTA LRQ QMQ TQNL SAY IIP 72 



481 GGCACAGATG CTCACATGAA CGAGTACATC GGCCAACATG ACGAGAGGCG TGCGTGGATT 540 

73GTDA HMN EYI GQHD ERR AWI 92 

541 ACAGGCTTTA CAGGGTCTGC AGGAACTGCA GTGGTGACTA TGAAGAAAGC AGCTGTCTGG 600 

93TGFT GSA GTA VVTM KKA AVW 112 

601 ACCGACAGTC GCTACTGGAC TCAGGCTGAG CGGCAAATGG ACTGTAATTG GGAGCTCCAT 660 

113 TDSR YWT QAE RQMD CNW ELH 132 

6 61 AAGGAAGTTG GCACCACTCC TATTGTCACC TGGCTCCTCA CCGAGATTCC NGCTGGAGGG 72 0 

133 KEVG TTP IVT WLLT EIP AGG 152 



721 CGTGTGGGTT TTGACCCCTT CCTCTTGTCC ATTGACACCT GGGAGAGTTA TGATCTGGCC 780 
153 RVGF DPF LLS IDTW ESY DLA 172 



781 CTCCAAGGCT CTAACAGACA GCTGGTGTCC ATCACAACCA ATCTTGTGGA CCTGGTATGG 840 
173LQGS NRQ LVS ITTN LVD LVW 192 



841 GGATCAGAGA GGCCACCGGT TCCAAATCAA CCCATTTATG CCCTGCAGGA GGCATTCACA 900 

193GSER PPV PNQ PIYA LQE AFT 212 

9 01 GGGAGCACTT GGCAGGAGAA AGTATCTGGC GTCCGAAGCC AGATGCAGAA GCATCAAAAG 96 0 

213 GSTW QEK VSG VRSQ MQK HQK 232 
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Figure 46B 



961 GTCCCGACTG CCGTCCTTCT GTCGGCGCTT GAGGAGACGG CCTGGCTCTT CAACCTTCGA 1020 
233 VPTA V L L SAL EETA WLF N L R 252 



1021 GCCAGTGACA TCCCCTATAA CCCCTTCTTC TATTCCTACA CGCTGCTCAC AGACTCTTCT 1080 
253 ASDI PYN PFF YSYT LLT DSS 272 



1081 ATTAGGTTGT TTGCAAACAA GAGTCGCTTT AGCTCCGAAA CCTTGAGCTA TCTGAACTCC 1140 
273 IRLF ANK SRF SSET LSY LNS 292 



1141 AGTTGCACAG GCCCCATGTG TGTGCAAATC GAGGATTACA GCCAAGTTCG TGACAGCATC 12 00 
293 SCTG PMC VQI EDYS QVR DSI 312 



12 01 CAGGCCTACT CATTGGGAGA TGTGAGGATC TGGATTGGGA CCAGCTATAC CATGTATGGG 12 60 
313 QAYS LGD VRI WIGT SYT MYG 332 



12 61 ATCTATGAAA TGATACCAAG GGAGAAACTC GTGACAGACA CCTACTCCCC AGTGATGATG 1320 
333 IYEM IPR E K L VTDT YSP VMM 352 



1321 AC C AAGGC AG TGAAGAACAG CAAGGAGCAG GCCCTCCTCA AGGCCAGCCA CGTGCGGGAC 13 80 

353 TKAV KNS KEQ ALLK ASH VRD 372 

13 81 GCTGTGGCTG TGATCCGGTA CTTGGTCTGG CTGGAGAAGA ACGTGCCCAA AGGCACAGTG 1440 

373 AVAV IRY L V W LEKN VPK GTV 392 

1441 GAT GAGTTTT CGGGGGCAGA GATCGTGGAC AAGTTCCGAG GAGAAGAACA GTTCTCCTCC 15 00 

393 DEFS GAE IVD KFRG EEQ FSS 412 



15 01 GGACCCAGTT TTGAAACCAT CTCTGCTAGT GGTTTGAATG CTGCCCTGGC CCACTACAGC 1560 
413 GPSF ETI SAS GLNA ALA HYS 432 



1561 CCGACCAAGG AGCTGAACCG CAAGCTGTCC TCAGATGAGA TGTACCTGCT GGACTCTGGG 1620 
433 PTKE LNR KLS SDEM YLL DSG 452 



1621 GGGCAGT AC T GGGACGGGAC CACAGACATC ACCAGAACAG TCCACTGGGG CACCCCCTCT 1680 
453 GQYW DGT TDI TRTV HWG TPS 472 



1681 GCCTTTCAGA AGGAGGCATA TACCCGTGTG CTGATAGGAA ATATTGACCT GTCCAGGCTC 17 40 
473 AFQK EAY TRV LIGN IDL SRL 492 



1741 ATCTTTCCCG CTGCTACATC AGGGCGAATG GTGGAGGCCT TTGCCCGCAG AGCCTTGTGG 1800 
493 IFPA ATS GRM VEAF ARR ALW 512 

1801 GATGCTGGTC TCAATTATGG TCATGGGACA GGCCACGGCA TTGGCAACTT CCTGTGTGTG 18 60 
513 DAGL NYG HGT GHGI GNF LCV 532 
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1861 CATGAGTGGC CAGTGGGATT CCAGTCCAAC AACATCGCTA TGGCCAAGGG CATGTTCACT 1920 
533 HEWP VGF QSN NIAM AKG MFT 552 



2041 GTGCCCTATG ACCGGAACCT CATCGATGTC AGCCTGCTGT CTCCNGAGCA TCTCCAGTAC 2100 
593 VPYD RNL IDV SLLS PEH LQY 612 

2101 CTGAATCGCT ACTACCAGAC CATCCGGGAG AAGGTGGGTC CAGAGCTGCA GAGGCGCCAG 2160 
613 LNRY YQT IRE KVGP ELQ RRQ 632 

2161 CTACTAGAGG AGTTCGAGTG GCTTCAACAG CACACAGAGC CCCTGGCCGC CAGGGCCCCA 22 20 
633 LLEE FEW LQQ HTEP LAA RAP 652 



2341 GCTCCCCTCA CCCTGCACTG AACATACCCC AAGAGCCCCT GCTGGCCCAT TGCCTAGAAA 2400 
2401 CCTTTGCATT CATCCTCCTT CTCCAAGACC TATGGAGAAG GTCCCAGGCC CCAGGAAACA 2460 
2461 CAGGGCTTCT TGGCCCCAGA TGGCACCTCC CTGCACCCCG GGGTTGTATA CCACACCCTG 2520 
2521 GGCCCCTAAT CCCAGGCCCC GAAATAGGAA AGCCAGCTAG TCTCTTCTCT TCTGTGATCT 25 80 
2581 CAGTAGGCCT AACCTATAAC CTAACACAGA CTGCTACAGC TGCTCCCCTC CCGCCAAACA 2640 
2 641 AAGCCCCAAG AAAACAATGC CCCTACCACC CAAGGGTGCC ATGGTCCCGG GAAAACCCAA 2700 
2701 CCTGTCACCG CGTGTTGGGC GTAACCAGAA CTGTTCCCCC CCACCAGGGC TTAAAAATCG 2 7 60 
2761 CCCCCACTTT TTAACCATCG TCCATTAACC ACCTGGTGGG C ATAGC C AG A GCTGTTCGAA 2820 
2821 CCCAGCCAGG GATGAAAAAT CAACCCCCGA CATGGAACCC ATGATTCCTA AACCCGGGGT 2 880 
2881 AGGTTCCATG CCAAGTAACA GCAGAGGGAG TTAAGCCATA GGAATTTGGC TGTGGAGTAA 2940 
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Figure 46D 

2941 GAGGGAATGC GGTGAGGCAG TGTGGAATAT GACCCTACCA GAGGTTGGAG AACAAACTTG 30 00 
3001 GGCAGCCGGA ACCCGTCACT ATTTTAGATT CCTGGCATTC GAGGAGCCCT TTGAACTTTC 3 060 
3 061 CAAAGTGCAG CCACAGCTAC AATGCTGTTA AATCCTCCCA CATTTCTTGG ATGCCCCTTC 3120 
3121 ACCTTGTGTG GACAGTGTCT GGTTTCCCCA TTTTACAGAC AGGAAAACTG AGCTTCAGAC 3180 
3181 AGGGGGTGGG CTTTGCCTAA GGACACACAA ATTTGGTTGG GAGTTGATGG GGCCAGATGA 3240 
3 241 GCCAGCATTC CAGCTGTTTC ACCCTTCAGC AACATGCAGA GTCCCTGAGC CCACCTCCCA 3 30 0 
3 301 GCCCTCTCCT CATTCTCTGA ACCCACTGTG GTGAGAAGAA TTTGCTCCGG CCAAATTGGC 336 0 
3361 CGTTAGCCAC CTGGGTCCAC ATCCTGCTAA GACGTTTAAA ACAGCCTAAC AAAGACACTT 342 0 
3421 GCCTGTGG 3428 



